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Abstract 



The conventional interpretation of quantum mechanics, though it permits a correspondence 
to classical physics, leaves the exact mechanism of transition unclear. Though this was only 
of philosophical importance throughout the twentieth century, over the past decade new 
technological developments, such as quantum computing, require a more thorough understanding 
of not just the result of quantvim emergence, but also its mechanism. Quantum decoherence theory 
is the model that developed out of necessity to deal with the quantum-classical transition explicitly, 
and without external observers. In this thesis, we present a self-contained and rigorously argued 
full derivation of the master equation for quantum Brownian motion, one of the key results 
in quantum decoherence theory. We accomplish this from a foundational perspective, only 
assuming a few basic axioms of quantum mechanics and deriving their consequences. We 
then consider a physical example of the master equation and show that quantum decoherence 
successfully represents the transition from a quantum to classical system. 
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CHAPTER 

Preface 



THis thesis is designed to serve a dual purpose. First, it is a stand-alone treatment of 
contemporary decoherence theory, accomplishing this mostly within a rigorous framework 
more detailed than is used in typical undergraduate quantum mechanics courses. It assumes no 
prior knowledge of quantum mechanics, although a basic understanding obtained through a 
standard introductory quantum mechanics or modern physics course would be helpful for depth 
of meaning. Although the mathematics used is introduced thoroughly in chapter 1, the linear 
algebra can get quite complicated. Readers who have not had a formal course in linear algebra 
would benefit from having ref. [1] on-hand during some components, especially chapters 2 and 3. 
The bulk of the work specifically related to decoherence is found in the last three chapters, and 
readers familiar with quantum mechanics desiring a better grasp of decoherence theory should 
proceed to the discussion of quantum mechanics in phase-space, found in chapter 4. 

Second, this thesis is an introduction to the rigorous study of the foundations of quantum 
mechanics, and is again stand-alone in this respect. It develops the bulk of quantum mechanics 
from several standard postulates and the invariance of physics under the Galilei group of 
transformations, outlined in sections 2.1 and 3.1, respectively. Readers interested in this part of 
the thesis should study the first three chapters, where many fundamental results of quantum 
mechanics are developed. We now begin with a motivating discussion of quantum decoherence. 

One of the fundamental issues in physics today is the emergence of the familiar macroscopic 
physics that governs everyday objects from the strange, underlying microscopic laws for the 
motion of atoms and molecules. This collection of laws governing small bodies is called quantum 
mechanics, and operates entirely differently than classical Newtonian physics. However, since all 
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macroscopic objects are made from microscopic particles, which obey quantum mechanics, 
there should be some way to link the two worlds: the macro and the micro. The conventional 
interpretation of quantum mechanics answers questions about the transition from classical to 
quantum mechanics, known as quantum emergence, through a special measurement process, 
which is distinct from the other rules of quantum mechanics [2].^ 

However, when this measurement concept is used, problems arise. The most famous of these 
problems is known as Schrodinger's cat, which asks about the nature of measurement through a 
paradox [3]. The problem creates ambiguity about 

1. when a measurement occurs, and 

2. who (or what) performs it. 

When all is said and done, the conventional interpretation leaves a bitter taste in the mouths of 
many physicists; what they want is a theory of quantum measurement that does not function due 
to subjectively defined observation. If no external observers are permitted, how can classical 
mechanics ever emerge from quantum mechanics? The answer is that complex systems, in essence, 
measure themselves, which leads us to decoherence. 

0.1 Decoherence and the Measurement Problem 

Quantum decoherence theory is a quantitative model of how this transition from quantum 
to classical mechanics occurs, which involves systems performing local measurements on 
themselves. More precisely, we divide our universe into two pieces: a simple system component, 
which is treated quantum mechanically, and a complex environmental component, which is treated 
statistically.^ Since the environment is treated statistically, it obeys the rules of classical (statistical) 
mechanics, and we call it a mixture [4] . When the environment is coupled to the system, any 

^In fact, the motion of a system not being measured is considered unitary, and hence reversible, 
whUe the measurement process is conventionally considered discontinuous, and hence irreversible. 
So, not only are they treated separately, but they are considered fundamentally different processes! 

^The words statistical and classical are being tossed around here a bit. What we mean is 
statistical in the thermodynamic sense, for example probability distributions prepared by random 
coin- tosses. These random, statistical distributions are contrasted against quantum states, which 
may appear to be random when observed, but actually carry quantum interference information. 
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Figure 1: A graphical representation of decoherence. Here, the environment, which is treated 
statistically, can be thought of as an information reservoir. It serves to absorb the quantum interference 
properties of the system, making the system appear as a classical, statistically prepared state. 

quantum mechanical information that the system transfers to the environment is effectively lost, 
hence the system becomes a mixture over time, as indicated in figure 1 . 

In the macroscopic world, ordinary forces are huge compared to the subtle effects of quantum 
mechanics, and thus large systems are very difficult to isolate from their environments. Hence, the 
time it takes large objects to turn to mixtures, called the decoherence time, is very short. It is 
important to keep in mind that decoherence is inherently local. That is, if we consider our entire 
universe, the system plus the environment, quantum mechanically, classical effects do not 
emerge. Rather, we need to "focus" on a particular component, and throw away the quantum 
mechanical information having to do with the environment [3]. 

In order to clarify this notion of decoherence, we examine the following unpublished example 
originally devised by Herbert J. Bernstein [5]. To start, consider an electron gun, as shown in figure 
2. Electrons are an example of a two-state system, and as such they possess a quantum-mechanical 
property called spin [6] . As we develop in detail later in section 2.4, the spin of a two-state system 
can be represented as a vector pointing on the unit two-sphere. Further, any possible spin can be 
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Figure 2: A sketch of Bernstein's thought experiment. The electrons with initial random spin are set to a 
certain angle in the xy-plane at the first angular control. A switch determines whether or not an additional 
phase factor is added using a roulette wheel. Then, a Stern-Gerlach analyzer is used to measure the angle of 
electron spin. 

formed as a linear combination of a spin pointing up in the z direction, and a spin pointing down 
in the -z direction.^ 

We suppose that our electron gun fires electrons of random spin, and then we use some angular 
control device to fix the electron's spin to some angle (that we set) in the xy-plane. Then, we use a 
Stern-Gerlach analyzer adjusted to some angle to measure the resulting electron. The Stern-Gerlach 
analyzer measures how close its control angle is to the spins of the electrons in the beam passing 
through it [5]. It reads out a number on a digital display, with 1 corresponding to perfect alignment 
and corresponding to anti-alignment. So far, we can always use the analyzer to measure the 
quantum-mechanical spin of each electron in our beam. We simply turn the analyzer's angular 
control until its digital display reads one, and then read the value of the angular control. Similarly, 
if we were to turn the analyzer's control to the angle opposite from the beam's angle, the display 
would read zero. The fact that these two special angles always exist is fundamental to quantum 
mechanics, resulting from a purely non-classical phenomenon called superposition."' We next 

■'in linear algebra terminology, we call the spin vectors pointing in +z and -z a basis for the 
linear vector space of all possible states. We deal with bases precisely in section 1.1. 

''The precise nature of quantum superposition is rather subtle, and we discuss it at length in 
section 2.4. 
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insert another component into the path of the electron beam. By turning on a switch, we activate a 
second device that adjusts the angle of our beam in the xy-plane by adding 6. The trick is that this 
device is actually attached to a modified roulette wheel, which we spin every time an electron 
passes. The roulette wheel is labeled in radians, and determines the value of 6 [5]. 

We now frantically spin the angular control attached to our analyzer, attempting to find the 
initial angle of our electron beam. However, much to our surprise, the display appears to be stuck 
on 0.5 [5]. This reading turns out to be no mistake, since the angles of the electrons that the 
analyzer is measuring are now randomly distributed (thanks to the randomness of the roulette 
wheel) throughout the xy-plane. No matter how steadfastly we attempt to measure the spin of the 
electrons in our beam, we cannot while the roulette wheel is active. Essentially, the roulette wheel 
is absorbing the spin information of the electrons, as we apparently no longer have access to it. 

This absorption of quantum information is the exact process that the environment performs in 
quantvmi decoherence theory. In both cases, the information is lost due to statistical randomness, 
and forces a quantum system to be classically random as well. The roulette wheel in this simplified 
example, just like the environment in reality, is blocking our access to quantum properties of a 
system. In chapter 6, we return to a more physical example of decoherence using the quantitative 
tools we develop in this thesis. First, we need to discuss the mathematical underpinnings of 
quantum mechanics. 

0.2 NOTATIONAL CONVENTIONS 

Throughout this thesis, we adopt a variety of notational conventions, some more common than 
others. Here, we list them for clarity. 

• The symbol {=) will always be used in the case of a definition. It indicates that the equality 
does not follow from previous work. The {-) sign indicates equality that logically follows 
from previous work. 

• An integral symbol without bounds. 



is a definite integral from -oo to +oo, rather than the antiderivative, unless otherwise noted. 
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Usually, the differential term in an integrand will be grouped with the integral symbol and 
separated by (•). This is standard multiplication, and is only included for notational clarity. 

Vectors are always given in Dirac kets, ( |-) ), operators on abstract vector or Hilbert spaces 
are always given with hats, ( ^ ), linear functionals over vector spaces are given in Dirac bras, 
( (-1 ), and operators on function spaces are given with checks, 

Both partial and total derivatives are given using either standard Leibniz or in a contracted 
form dx, where 




The symbol (<->) is used to denote a special representation of a particular structure. Its 
precise definition is made clear by context. 

The symbol (*) is used to denote the complex conjugate of a complex number. 
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CHAPTER 



1 



Mathematical background 



BEfore we begin our discussion of quantum mechanics, we take this chapter to review the 
mathematical concepts that might be unfamiliar to the average undergraduate physics 
major wishing a more detailed understanding quantum mechanics. We begin with a discussion of 
linear vector spaces and linear operators. We next generalize these basic concepts to product 
spaces, and finally consider spaces of infinite dimension. Quantum mechanics is much more 
abstract than other areas of physics, such as classical mechanics, and so the immediate utility of the 
techniques introduced here is not evident. However, for the treatment in this thesis to be mostly 
self-contained, we proceed slowly and carefully. 

1.1 Linear Vector Spaces 

In this section, we introduce linear vector spaces, which will be the stages for all of our subsequent 
work.^ We begin with the elementary topic of vector spaces [1]. 

^ Well, actually we will work in a triplet of abstract spaces called a rigged Hilbert space, which 
is a special type of linear vector space. However, most textbooks on quantum mechanics, and even 
most physicists, do not bother much with the distinction. We will look at this issue in more detail 
in section 1.4. 
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Definition 1.1 (Vector space). Let F be a field with addition (+) and multiplication (■). A 
set y is a vector space under the operation (®) over F if for all |m) , li;) ,\w) eV and 
a,beF: 

1. \u)®\v) = |z;)® |m). 

2. (|m) ® \v)) ® \w) = \u) ® ® \w)). 

3. There exists |0) e V such that |0> ® \u) = \u). 

4. There exists - \u) eV such that - |m) ® |m) = |0). 

5. a-{b\u)) = {a-b)\u). 

6. (fl + b) \u) = a\u) + b\u). 

7. a{\u) + \v)) - a\u) +a \v). 

8. For the unity of F, 1, 1 |m) = |u). 



If V satisfies the criteria for a vector space, the members \u) e V are called vectors, and the 
members a e F are called scalars. For the purposes of quantum mechanics, the field F we are 
concerned with is almost always C, the field of complex numbers, and V has the usual (EucHdean) 
topology.^ Since the operation (ffi) is by definition interchangeable with the field operation (+), it is 
conventional to use the symbol (+) for both, and we do so henceforth [7]? 



Definition 1.2 (Linear dependence). A collection of vectors {|Ca)}£V6A/ where A is some 
index set, belonging a vector space V over F is linearly dependent if there exists a set 
(««}aeA such that 



given that at least one a, e i= 0. 



This means that, if a set of vectors is linearly dependent, we can express one of the member vectors 



^The fields we refer to here are those from abstract algebra, and should not be confused with 
force fields (such as the electric and magnetic fields) used in physics. Loosely speaking, most of the 
sets of numbers we deal with in physics are algebraic fields, such as the real and complex numbers. 
For more details, see ref [7]. 

^In definition 1.2, we use the notion « e A, which might be foreign to some readers. A is 
considered an index set, or a set of all possible allowed values for a. Then, by a e A, we are letting 
a run over the entire index set. Using this powerful notation, we can treat almost any type of 
general sum or integral. For more information, see ref. [8] 
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in terms of the others. If a set of vectors is not linearly dependent, we call it linearly independent, 
in which case we would not be able to express one of the member vectors in terms of the others [1]. 



Definition 1.3 (Dimension). Consider the vector space V and let !|c)ct}aeA £ V be an 
arbitrary set of linearly independent vectors. Then, if A is alway finite, the dimension of 
V is the maximum number of elements in A. If A is not always finite, then V is said to 
have infinite dimension. 



Definition 1.4 (Basis). Let B = {\v„)}a€A £ V, where V is a vector space over the field F. 
If \Va) and \vp^ when a i= ji are linearly independent and an arbitrary vector \u) e V can 
be written as a linear combination of {VaYs, i.e. 

= (1.2) 

tteA 

with Ca e F, we say {|i'a)}aeA is a basis set or basis for V. 



It follows directly from this definition that, in any vector space with finite dimension 
D, any basis set will have precisely D members. Because quantum mechanics deals with a 
Euclidean vector space over the complex numbers, it is advantageous to precisely define the inner 
product of two vectors within that special case [4]. 



Definition 1.5 (Inner product). Let V be a vector space over the field of complex numbers 
C. Then, g : V x V — > C is an inner product if, for all |w) , \v) ,\w} eV and a, jS e C, 

1. g{\u)Av)) = g{\v),\u)y, 

2. g{\u),a\v) + b\w))=a-g{\u),\v)) + b-g{\u),\iv)), 

3. g{\u),\u)) > Owithg(|M),|M)) = « \u) = 10). 



Although it is not immediately clear, the inner product is closely related to the space of linear 
functionals on V, called the dual space of V and denoted V*. Below, we define these concepts 
precisely and then show their connection through the Riesz representation theorem [4]. 
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Definition 1.6 (Linear functional). A linear functional on a vector space V over C is any 
function F : V ^ C such that for all «, jS e C and for all \u) , \v) e V , 

V{a\u) + h\v))=a-¥i\u)) + h-V{\v)). (1.3) 

We say that the space occupied by the linear functionals on V is the dual space of Y , and 
we denote it by V* . 



We connect the inner product with the dual space V using the Riesz representation theorem [4]. 



Theorem 1.7 (Riesz representation). Let V be a finite-dimensional vector space and V* be 
its dual space. Then, there exists a bijection h:V* defined by /!(F) = |/) for F e V* 
and |/) e V such that F (|m)) = g (|/) , |w)) V |w) e Y, where g is an inner product of V [4]. 



The proof of this theorem is straightforward, but too lengthy for our present discussion, so we will 
reference a simple proof for the interested reader [4]. The consequences of this theorem are quite 
drastic. It is obviously true that the inner product of two vectors, which maps them to a scalar, is a 
linear functional. However, the Riesz theorem asserts that any linear functional can be represented 
as an inner product. This means that every linear fimctional has precisely one object in the dual 
space, corresponding to a vector in the vector space. For this reason, we call the linear functional 
associated with with |m) a dual vector and write it as 

<w| e r, (1.4) 

and we contract our notation for the inner product of two vectors \u) and \v) to 

g(\u),\v)) = (u\v), (1.5) 

a notational convention first established by R A. M. Dirac. The vectors in V are called kets and the 
dual vectors, or linear functionals associated with vectors in V* , are called bras. Hence, when we 
adjoin a bra and a ket, we get a bra-ket or bracket, which is an inner product. Note that by the 
definition of the inner product, we have 

{ii\p) = (^'l") / (1-6) 
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so if we multiply some vector \v) by a (complex) scalar a, the corresponding dual vector is a* {v\. 
When we form dual vectors from vectors, we must always remember to conjugate such scalars. As 
another note, when choosing a basis, we frequently pick it as orthonormal, which we define below 
[1]. 



Definition 1.8 (Orthonormality of a Basis). A basis B for some vector space V is 
orthonormal if any two vectors \(pi^ and \(pj^ in B satisfy 



[1 iff^y 

10 iff?t; 



(1.7) 



For any vector space, we can always find such a basis, so we do not lose any generality by always 
choosing to use one.* 

A useful example that illustrates the use of vectors and dual vectors can be found by 
constraining our vector space to a finite number of dimensions. Working in such a space, we 
represent vectors as column matrices and dual vectors as row matrices [6]. For example, in three 
dimensions we might have 



|ei) ^ 



and 



1 











(1.8) 



(1.9) 



where |ei) and |e2) are the unit vectors from basic physics [9]. Then, the linear functional 



*The process for finding an orthonormal basis is called the Graham- Schmidt algorithm, and 
allows us to construct an orthonormal basis from any basis. For details, see ref. [1]. 
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1 = 



(1.10) 



We represent the inner product as matrix multiplication, so we write 



1 



- 1 




-i 







= 0, 



(1.11) 







which indicates that |ei) and \e2) are orthogonal, as we expect. 

1.2 Linear Operators 

So far, we have looked at two main types of objects in a vector space: vectors and linear functionals. 
In this section, we focus on a third: the linear operator Recall that linear functionals take vectors to 
numbers. Similarly, linear operators are objects that take vectors to other vectors. Formally, this is 
the following definition [10]. 



Definition 1.9 (Linear Operator). Let \u) ,\v) eV be vectors and a, jS be scalars in the 
field associated with V. Then, we say A is a linear operator on V if 



Throughout the rest of this thesis, whenever we discuss an operator on a vector space, 
we will always use a hat to avoid confusion with a scalar In a finite dimensional vector space, as 
indicated previously, we often represent vectors by column matrices and dual vectors by row 



A\v)eV 



(1.12) 



and 



A (a \u) + j3 \v)) = aA \u) + j3A \v) . 



(1.13) 



^Here, notice that to generate the representation for (£2! from \e2), we must take the complex 
conjugate. This is necessary due to the complex symmetry of the inner product established in eqn. 
1.6. 
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matrices. Similarly, we represent operators by square matrices [6]. For example, if 



A *^ 





1 




(1.14) 



then 



A\ei) « 





1 




1 







^ lei) ■ 



(1.15) 



We can also use our formalism to access individual elements of an operator in its matrix 
representation. Working in the three-dimensional standard, orthonormal basis from the example 
above, we specify B as 

B\u)^\v), (1.16) 



where 



|m) = Ml |ei) + U2 k2) + M3 h) 



(1.17) 



and 



= vi |ei) + V2 Yi) + v-i leg) . 



(1.18) 



Then, 



<e;| B \u) = {e,\ B (mj |ei) + [62) + 1/3 ks)) 

3 

= (e,iB2],"yh) 



= W 



Vi, 



(1.19) 



which is just the matrix equation [4] 



3 

J^B{i,j)uj = Vj, 



(1.20) 
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where we made the definition 

B,j = B{i,j) = {ei\B\ej). (1.21) 

We call B{i, j) the matrix element corresponding to the the operator B. Note that the matrix 
elements of an operator depend on our choice of basis set. Using this expression for a matrix 
element, we define the trace of an operator. This definition is very similar to the elementary notion 
of the trace of a matrix as the sum of the elements in the main diagonal.'' 

Definition 1.10 (Trace). Let A be an operator on the vector space V and let B = {\Va}]a€A £ 
V be an orthonormal basis for V. Then, the trace of A is 

Tr(A) = J^{Va\A\v„}. (1.22) 

aeA 



So far, we have defined operators as acting to the right on vectors. However, since the 
Riesz theorem guarantees a bijection between vectors and dual vectors (linear functionals in the 
dual space), we expect operators to also act to the left on dual vectors. To make this concept 
precise, we write a definition. 

Definition 1.11 (Adjoint). Suppose \u) ,\v) eV such that an operator on V, A, follows 

A\u) = \v). (1.23) 

Then, we define the adjoint of A, A^, as 

{u\A^s{v\. (1.24) 



From this definition, it follows that 

{{u\A^\iv)J = {v\w)' 

= {w\A\u), (1.25) 



^Since the individual matrix elements of an operator depend on the basis chosen, it might seem 
as if the trace would vary with basis, as well. However, the trace turns out to be independent of 
basis choice [4]. 
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which is an important result involving the adjoint, and is sometimes even used as its definition. 
This correctly suggests that the adjoint for operators is very similar to the conjugate transpose for 
square matrices, with the two operations equivalent for the matrix representations of finite vector 
spaces/ 

Although the matrix representation of an operator is useful, we need to express operators using 
Dirac's bra-ket notation. To do this, we define the outer product [6]. 



Definition 1.12 (Outer Product). Let |m) ,\v) eV be vectors. We define the outer product 

of |m) and \v) as the operator A such that 

A = \u){v\. (1.26) 



Note that this is clearly linear, and is an operator, as 

( \u) {v\ ) la;) = \u) (v\w) = (v\w) \u) e V (1.27) 

for \u) , \v) , \w) eV, a vector space. Further, if an operator is constructed in such a way, eqn. 1.25 
tells us that its adjoint is 

{\u){v\y = \v){u\. (1.28) 
Self-adjoint opeartors, i.e. operators such that 

A^=A, (1.29) 



are especially important in quantum mechanics. The main properties that make self-adjoint 
operators useful concern their eigenvectors and eigenvalues.** We summarize them formally 
in the following theorem [4]. 



^Many physicists, seeing that linear fimctionals are represented as row matrices and vectors are 
represented as column matrices, will write \v) = {v\^. This is not technically correct, as the formal 
definition 1.11 only defined the adjoint operation for an operator, not a functional. However, 
though it is an abuse of notation, it turns out that nothing breaks as a result [4] . For clarity, we will 
be careful not to use the adjoint in this way. 



^We assume that the reader has seen eigenvalues and eigenvectors. However, if not, see ref. [1] 
or any other linear algebra text for a thorough introduction. 
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Theorem 1.13 (Eigenvectors and Eigenvalues of Self -adjoint Operators). Let A be 
a self-adjoint operator. Then, all its eigenvalues are real and any two eigenvectors 
corresponding to two distinct eigenvalues are orthogonal. 



Proof. Let A\u) = u \u) and A\v) = v \v) so that \u) and \v) are arbitrary (nonzero) eigenvectors of A 
corresponding to the eigenvalues u and v. Then, using eqn. 1.25, we deduce [4] 

ululu) = (m|m|m) 



= {u\A'\uy 
= {u\A\uy 

= (m|m|m)* 




= m*(m|m). (1.30) 

Since \u) + 0, we get u = it , so u is real. Hence, any arbitrary eigenvalue of a self-adjoint operator 
is real. Next, we consider combinations of two eigenvectors. That is, 

= {u\A\v) - {u\A\v) 

= {u\A\v)-{v\A'^\uy 

= {u\A\v) - {v\A\uy 

= {u\v\v) - {v\u\uy 

= {v-u){u\v). (1.31) 

Thus, ifvi^u, (u\vj = 0, so \u) and \v) are orthogonal as claimed. □ 

Now that we have shown this orthogonality of distinct eigenvectors or an operator, we would 
like to claim that these eigenvectors form a basis for the vector space in which the operator 
works. For finite dimensional spaces, this turns out to be the case, although the proof quite 
technical, so we omit it with reference [4] . However, infinite dimensional cases produce problems 
mathematically, hence the eigenvectors of an operator in such a space need not form a basis for 
that space [4]. For the moment, we will proceed anjrway, returning to this issue in section 1.4. 
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Suppose that {|Ca)}aeA is the set of all eigenvectors of the self-adjoint operator A. Since 
eigenvectors are only determinable up to a scaling factor, as long as our vectors are of finite 
magnitude, we may rescale all of these vectors to be an orthonormal set of basis vectors [1]. By our 
assumption, this set forms a basis for our vector space, V. Thus, for any \u) e V, we can write 

|W) = W« \Va) = K> Ua. (1.32) 

a€A a€A 

Noting that, since the basis vectors are orthonormal, 

(Vi\uj = 2^ M« {Vi\va) = Hi, (1.33) 



a€A 



we get 



|m) = 2^ \Va) {ya\u) = \Va) {Va\ 



a€A \aeA 

It follows immediately that 



(1.34) 



Yj\Va){Va\ = l (1.35) 



cteA 



which is called the resolution of the identity. This leads us to a result that allows us to represent 
self -adjoint operators in terms of their eigenvector bases, the spectral theorem [4]. 



Theorem 1.14 (Spectral Theorem). Let A be an operator on the vector space V. Assuming 
that the spectrum of eigenvectors of A, {|Ca)}aeA/ forms a basis for V, A can be expressed 
as 

A = 'Yaa\Va}{Va\, (1.36) 

a€A 

where [aa]a€A are the eigenvalues of A. 



Proof. Let \u) e V be an arbitrary vector. Then, since {|ya)}aeA is a basis for V, we can write 

\u) = Yj M ■ (1-37) 

neA 
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Hence, 

A\U) UaA \Va) = Uatta \Va) ■ (1.38) 

Now, we consider the other side of the equation. We get [4] 



ISeA 



= 'J^Y^aaUji\Va){va\vii) 

a€A fieA 

= ^ UaUa \Va) 



aeA 

= A\u), (1.39) 



where we used the orthonormality of our basis vectors. This holds for arbitrary \u) e V, so [4] 

A = J^aa\Va){Va\, (1.40) 

aeA 

as desired. □ 

Since we assumed that the eigenvectors for any self-adjoint operator formed a basis for the 
operator's space, we may use the spectral theorem to decompose self -adjoint operators into basis 
elements, which we make use of later. 

1.3 The Tensor Product 

So far, we have discussed two types of products in vector spaces: inner and outer. The tensor 
product falls into the same category as the outer product in that it involves arraying all possible 
combinations of two sets, and is sometimes referred to as the cartesian or direct product [7]. We 
formally define the tensor product operation (®) below [6]. 
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Definition 1.15 (Tensor Product). Suppose V and W are two vector spaces spanned by 
the orthonormal bases {|c«)}aeA and { \wp^ j^^^,, respectively. Then, we define the tensor 
product space, or product space, as the space spanned by the basis set 



x) , ly)) : \x) e {\Va)]aeA, \y) e { \wii) }^^^^} 



(1.41) 



and denote the space asV <^W. We call each ordered pair of vectors a tensor product of 
the two vectors and denote it as \x) ® ly). We require 



(8) |yi)) I (|X2) ® |y2))) = {xi\x2) ® (yi|y2) ■ 



(1.42) 



The tensor product is linear in the normal sense, in that it is distributive and can absorb scalar 
constants [6]. Further, we define linear operators on a product space by 



(A (g) \v) (g) \zv) =A\v)®B \w) . 



(1.43) 



The definition for the tensor product is quite abstract, so we now consider a special case in a matrix 
representation for clarity. Consider a a two-dimensional vector space, V, and a three-dimensional 
vector space W. We let the operator 



A ^ 



1 -i 
2 



(1.44) 



act over V, and the operator 



B ^ 



i 


2 


-1 





1 


-2 


2i 


-1 






act over W. Then, operating on arbitrary vectors, we find 



(1.45) 



A\v) 



1 

2 



Vi 
'02 



V\ - IV2 
2V2 



(1.46) 
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i 2 -1 
1 -2 
2i -1 



\ 


ZVi 




izv\ + 2zv2 - 






W2 




W2 - 2W3 


(1.47) 




. ^3 ^ 




2iwi - W2 





The representation of the tensor product as a matrix operation is called the Kronecker product, 
and is formed by nesting matrices from right to left and distributing via standard multiplication 
[6]. We now illustrate it by working our example. 



A\v)®B \w) ^ 



V\ - IV2 
2V2 



iwi + 2w2 - If 3 

W2 - 2W3 

2iwi - W2 



iwi + 2zv2 - 103 

{Vi - iV2) W2 - 2W3 

2iwi - W2 

( 

iwi + 2w2 - IV3 

2V2 W2 - 2W3 

2iwi - W2 
{vi - iv2) {iwi + 2w2 - W3) 

{Vi - iV2) {W2 - 2W3) 

(ci - iv2) {2iwi - W2) 
2v2 {iwi + 2w2 - W3) 

2V2 {W2 - 2W3) 

2v2 {2iwi - W2) 



But by eqn. 1.43, we should be able to first construct the tensor product of the of the operators A 
and B and apply the resulting operator to the tensor product of \v) and \zv). Working this out using 



(1.48) 
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the Kronecker product, we have 



1 -i 
2 



i 2 -1 
1 -2 
2i -1 



i 2-1 i 2 

1-2 1 

2f -1 J [ 2i -1 

z 2 -1 2 2 

1 -2 2 1 

2i -10 2i -1 

\ /V 

/ 2-11 -2i i 

1-20 -i 2i 

2i -1 2 i 

2/ 4 -2 

2 -4 

4z -2 



and 



\v) ® \w) 



Vi 





V\Wi 


( \ 




V2WI 


W\ 










VIW2 


W2 










V2W2 


. ^3 , 












^ V2W3 ^ 
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so 



A ® B {\v} ® \w}) <-> 



i 


2 





1 


2i 


-1 





















-1 1 -2i i 

-2 -f 2f 

2/0 

2i 4 -2 

2 -4 

4i -2 



ViWi 

V2W1 

V\W2 
V2W2 



I \ 

iv\W\ + 2v2W\ - V1W2 + V2W2 - 2iviW3 + iv2iV3 

V1W2 - iV2W2 - 2ViWz + 2iV2W3 
2iViWi + 2V2Wi - V\W2 + iV2W2 
2iV2Wi + ^V2W2 - 2V2W3 
lV2'W2 - 4172^3 
4iV2'Wi - lV2'W2 

{vi - iv2) (iwi + 2w2 - If 3) 

{Vi - iV2) {W2 - 2W3) 

{vi - iv2) {2iwi - W2) 
2v2 (iyo\ + 2w2 - W3) 

2V2 (W2 - 2W3) 

2v2 {2iwi - W2) 
<-> A\v)^B\w), 



(1.51) 



and we confirm that this example follows 

[A^b)\v)^\w) =A\v)»B\w) (1.52) 

when we use the Kronecker product representation for the tensor product. Since the matrix 
representation is very convenient for finite dimensional vector spaces, we frequently use the 
Kronecker product to calculate the tensor product and then shift back to the abstract Dirac notation. 
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So far, we have largely ignored the main complication that arises when we move from a finite 
dimensional space to an infinite one: the spectrum of eigenvectors for a self-adjoint operator is no 
longer guaranteed to form a basis for the space. To deal with this problem, we will have to work in 
a slightly more specific kind of vector space, called a Hilbert space, denoted 'H. A Hilbert space is 
defined below [4]. 



Definition 1.16 (Hilbert Space). Let W be a general linear vector space and suppose 
that y c W is a vector space formed by any finite linear combinations of the basis set 
{\Va)]a€A- That is, if 

n 

I") = Yl, l'^^"' ' ^^-^^^ 
i=\ 

for some finite n, then |m) e V. We say the Hilbert space H formed by completing V 
contains any vector that can be written as 

n 

\u) = lim \ Ua, \v)a_ , (1.54) 



1=1 



provided 



exists and is finite. 



Note that for the vector spaces described in the above definition, the Hilbert space as- 
sociated with them always follows V Q'H QW, and that W = 'H -V holds if (but not only if) W 
has finite dimension. Without spending too much time on the technicalities, there is a generalized 
spectral theorem that applies to spaces very closely related to, but larger than, Hilbert spaces [4] . 
To determine precisely what this space should be, we must first develop a certain subspace of a 
Hilbert space, which we define by including all vectors \u) subject to 

oo 

{u\u) = Y^\u„„fn'" (1.56) 

«=i 

converging for all wi e N. For a Hilbert space, we require a much weaker condition, as we do not 
have the rapidly increasing n'" in each term of the summand. We define this space as Q, and note 
that always Q Q 'H [4]. The ramifications of the extra normalization requirement for a vector to be 
in Q can be thought of as a requirement for an extremely fast decay as n ^ oo. We now define the 
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r 



V 




J 



J 




Figure 1.1: The spaces V QO.Q'H QQ^ QYi . The area shaded blue is the rigged Hilbert space triplet. 

space of interest, called the conjugate space of Q, and written as in terms of its member vectors 
[8]. Any vector \w) belongs to if 



converges for all \u) e Q and {w\ is continuous on Q. Since we noted that for a vector \u) to be in Q, 
it must vanish very quickly at infinity, \w) is not nearly as restricted as a vector in '7Y. Thus, we 
have the triplet 



which is called a rigged Hilbert Space triplet, and is shown in figure 1.1 [4].^ We noted earlier that 
the set of eigenvectors of a self-adjoint operator need not form a basis for that operator's space if 
the space has infinite dimension. This means that the spectral theorem would break down, which 
is what we wish to avoid. Fortunately, a generalized spectral theorem has been proven for rigged 



^The argument used here is rather subtle. If the reader is not clear on the details, it will 
not impair the comprehension of later sections. To thoroughly understand this material, we 
recommend first reading the treatment of normed linear spaces in ref . [8], and then the discussion 
of rigged Hilbert spaces in refs. [4] and [11]. 



oo 




(1.57) 



(1.58) 
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Hilbert space triplets, which states that any self adjoint operator in 'H has eigenvectors in that 
form a basis for "H [4]. Due to this, we will work in a rigged Hilbert space triplet, which we will 
normally denote by the corresponding Hilbert space, 'H. We do this with the understanding that to 
be completely rigorous, it might be necessary to switch between the component sets of the triplet 
on a case-by-case basis. 

Now that we have outlined the space in which we will be working, there is an important special 
case of an infinite dimensional basis that we need to examine. If our basis is continuous, then we 
can convert all of our abstract summation formulas into integral forms, which are used very 
frequently in quantum mechanics, since the two most popular bases (position and momentum) are 
usually continuous. Specifically, suppose we have a continuous, orthonormal basis for a 
rigged Hilbert space "H given by | \<p^ }^^^' where O is a real interval. Then, if we have [4] 

|M) = 2],"<,'>W' I^) = E^<pW' (1-59) 

we find a special case of eqn. 1.6. This is 

(m|i;) = I d(p- u'v^, (1.60) 

where the integral is taken over the real interval O. Similarly, for an operator A, definition 1.10 
becomes [4] 

Tr(A)= 1 d(p-[(p\A\(p), (1.61) 
and for self -adjoint A, theorem 1.14 is 

A= r dcl)-a^\(p){(p\. (1.62) 
Jo 

When working in a continuous basis, these integral forms of the inner product, trace, and spectral 



common form of confusion when first studying quantum mechanics is the abstract notion of 
vectors. In classical mechanics, a vector might point to a particular spot in a physical space. 
However, in quantum mechanics, a vector can have infinite dimensionality, and so can effectively 
point to every point in a configuration space simultaneously, with varying magnitude. For this 
reason, a very clear distinction must be drawn between the vectors used in the formalism of 
quantum mechanics and the everyday vectors used in classical mechanics. 
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theorem will often be more useful in calculations than their abstract sum counterparts, and 
make extensive use of them in chapter 3. 



CHAPTER 



Formal Structure of Quantum Mechanics 



WE now use the mathematical tools developed last chapter to set the stage for quantum 
mechanics. We begin by listing the correspondence rules that tell us how to represent 
physical objects mathematically. Then, we develop the fundamental quanttmi mechanical 
concept of the state and its associated operator. Next, we investigate the treatment of composite 
quantum mechanical systems. Throughout this chapter, we work in discrete bases to simplify 
our calculations and improve clarity. However, following the rigged Hibert space formalism 
developed in section 1 .4, translating the definitions in this section to an infinite-dimensional space 
is straightforward both mathematically and physically. 



2.1 Fundamental Correspondence Rules of Quantum Mechanics 

At the core of the foundation of quantum mechanics are three rules. The first two tell us how to 
represent a physical object and describe its physical properties mathematically, and the third tells 
us how the the object and properties are connected. These three rules permit us to state a 
physical problem mathematically, work the problem mathematically, and then interpret the 
mathematical result physically [4]. 

The first physical object of concern is the state, which completely describes the physical aspects 
of some system [4]. For instance, we might speak of the state of a hydrogen atom, the state of a 
photon, or a state of thermal equilibrium between two thermal baths. 
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Axiom 2.1 (State Operator). We represent each physical state as a unique linear operator 
that is self-adjoint, nonnegative, and of unit trace, which acts on a Rigged Hilbert Space 
'H. We write this operator p and call it the state operator. 



Now that we have introduced the state, we can discuss the physical concepts used to 
describe states. These concepts include momentum, energy, and position, and are collectively 
known as d5mamical variables [4]. 



Axiom 2.2 (Observable). We represent each dynamical variable as a Hermitian linear 
operator acting on a rigged Hilbert space 'H whose eigenvalues represent all possible 
values of the dynamical variable. We write this operator using our hat (") notation, and 
call it an observable. 



We now link the first two axioms with the third [4]. 



Axiom 2.3 (Expectation Value). The expectation value, or average measurement of the 
value of an observable O over infinitely many identically prepared states (called a virtual 



Though we claimed that these three axioms form the fundamental framework of modern 
quantum mechanics, they most likely seem foreign to the reader who has seen undergraduate 
material. In the next section, we work with the state operator and show that, in a special case, the 
formalism following from the correspondence rules outlined above is identical to that used in 
introductory quantum mechanics courses. 

2.2 The State Operator 

In axiom 2.1, we defined p, the state operator. However, the formal definition is very abstract, so in 
this section we investigate some of the properties of the state operator in an attempt to solidify 
its meaning. Physicists divide quantum mechanical states, and thus state operators, into two broad 





(2.1) 
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categories. Any given state is either called pure or impure. Sometimes, impure states are also 
referred to as mixtures or mixed states. We now precisely define a pure state [4]. 



Definition 2.4 (Pure State). A given state is called pure if its corresponding unique state 
operator, p, can be written as 

where [i/^) e is called the state vector in a rigged Hilbert space "H, e 'H* is the 

linear functional corresponding to \\p), and {4'\4') = 1- If a state cannot be so represented, 
it is called impure. 



Although the importance of pure and impure states is not yet evident, we will eventually need 
an efficient method of distinguishing between them. The definition, which is phrased as an 
existence argument, is not well-suited to this purpose. To generate a more useful relationship, 
consider a pure state. We have 

p2 = pp = (\ip) {ip\) (\ip) {^\) = IV') ((V^IV^)) {ip\ = \^) (1) {^\ = {ip\ = p. (2.3) 

Thus, if a state is pure, it necessarily follows [4] 

p2 = p. (2.4) 

Although seemingly a weaker condition, this result turns out to also be sufficient to describe a pure 
state. To show this, we suppose that our state space is discrete and has dimension D} Invoking the 
spectral theorem, theorem 1.14, we write 

D 

where {p„}|^^^ is the spectrum of eigenvalues for p, corresponding to the unit-normed eigenvectors 
of p, 1 1(^„^ I _j- If we consider some 1 < / < D with D e Z and let p = p^, we have 



p\^i) = P'\'^h' (2.6) 



^This is mainly for our convenience. The argument for an infinite-dimensional space is similar, 
but involves the generalized spectral theorem on our rigged Hilbert space. 
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which is 



(2.7) 



so 



Pi = P] 



(2.8) 



or 



Py(l-Py) = 0- 



(2.9) 



Since all of the eigenvalues of p must also follow this relationship, they must all either be one or 
zero. But by axiom 2.1, Tr (p) = 1, so exactly one of the eigenvalues must be one, while all the 
others are zero. Thus, eqn. 2.5 becomes 



where we have taken Evidently, p is a pure state, and we have shown sufficiency [4]. 

At this point, it is logical to inquire about the necessity of the state operator, as opposed to a 
state vector alone. After all, most states treated in introductory quantum mechanics are readily 
represented as state vectors. However, there are many states that are prepared statistically, and so 
cannot be represented as a state vector An example of one of these cases is found in section 2.5. 
These impure states or mixtures turn out to be of the utmost importance when we begin to discuss 
quantum decoherence, the main focus of this thesis [12]. 

We now turn our attention to the properties of pure states, and illustrate that the state vectors 
defining pure state operators behave as expected under our correspondence rules. By axiom 2.3, 
we know that the expectation value of the dynamical variable (observable) A of a state p is 




(2.10) 




(2.11) 



If p is a pure state, then we can write 



p = \^){^V 



(2.12) 



Hence, \ A) becomes 





(2.13) 
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which, by definition 1.10, is [4] 



(A) = Y,{^n\(\li^){^\A)\cp„) 

11=1 
D 

= E(('^«W)(<'^I^W) 



;j=l 

= {4'\M4'), (2.14) 

where we have used definition 1.8 to pick the basis | \(pa'^ } to be orthonormal and contain the 
vector [i/').' This is the standard definition for an expectation value in introductory quantum 
mechanics, which we recover by letting p be pure [2, 13]. 

2.3 Composite Systems 

In order to model complex physical situations, we will often have to consider multiple, non-isolated 
states. To facilitate this, we need to develop a method for calculating the state operator of a 
composite, or combined, quantum system [4]. 



Axiom 2.5 (Composite State). Suppose we had a pure composite system composed of n 
substates, ■ Then, the composite state operator p of this combined system is given 
by 

p = pi®p2®---®p„, (2.15) 
where (®) is the tensor product, given in definition 1.15. 



Note that if p is pure, there exists some characteristic state vector \ip) of p where 

ll/)) = ® |!/'2) ® ■ ■ ■ ® (2.16) 

and each corresponds to p,. As an important notational aside, eqn 2.16 is frequently shortened 
to [6] 

\iP) = \4>liP2.:4>n), (2.17) 

^This works since \ip) is guaranteed to have unit magnitude by definition 2.4. 
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where the tensor products are taken as implicit in the notation. Just as we discussed dynamical 
variables associated with certain states, so can we associate dynamical variables with composite 
systems. In general, an observable of a composite system with n substates is formed by [6] 

d = di®d2®---®dn, (2.18) 

where each (5, is an observable of the fth substate. We have now extended the concepts of state 
and dynamical variable to composite systems, so it is logical to treat an expectation value of a 
composite system. Of course, since a composite system is a state, axiom 2.3 applies, so we have 

{d) = Tr(pd). (2.19) 

However, composite systems afford us opportunities that single systems do not. Namely, just as 
we trace over the degrees of freedom of a system to calculate expectation values on that system, 
we can trace over some of the degrees of freedom of a composite state to focus on a specific 
subsystem.^ We call this operation the partial trace over a composite system, and we define it 
precisely below [6]. 



Definition 2.6 (Partial Trace). Suppose we have an operator 






Q. = Qi®Q2®---®Q.„. 




(2.20) 


The partial trace of Q over Q, is defined by 






Tr, [Q)=Qi®Q2®---® 4-1 ■ Tr (4) ■ Qi+i ® ■ ■ 


■ ®<a„- 


(2.21) 



If the partial trace is applied to a composite system repeatedly such that all but one of 
the subsystem state operators are traced out, the remaining operator is called a reduced state 
operator [6]. 



'Here, a degree of freedom of a state can be thought of as its dimensionality. It is used 
analogously with the notion in a general system in classical mechanics, where the dimensionality 
of a system's configuration space corresponds to the number of degrees of freedom it possesses. 
For more on this, see ref. [14]. 
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Definition 2.7 (Reduced State Operator). Suppose we have a composite system p with n 
subsystems. The reduced state operator for subsystem i is defined by 



p('' = Tri o Tr2 o ■ ■ ■ o Tr,_i o Tr,+i o ■ ■ ■ o Tr„ (p) . 



(2.22) 



The partial trace and reduced state operator turn out to be essential in the analysis of 
composite systems, although that fact is not immediately obvious. To illustrate this, we consider 
some observable Om that acts only on the A:,„th subsystem of a composite system. We choose a basis 
I \(t>k) where each element is formed by the Kronecker product of the basis elements of the 
corresponding subsystems. That is, each basis vector has the form |0;c) = |(/)i<|)2--.</';j)/ where each 
(pi is one of the orthonormal basis vectors of the Zth substate space. Then, from axiom 2.3, we have 



(a„) = Tr(pa,) 

n 

2]^(o,ipa, io,) 

k=l 

{(Ph(pk2-(pk„ I pd,„ \(pk,(pk2-(pk„) ■ 



(2.23) 



ki,k2,...,k„ 



We use the resolution of the identity, eqn. 1.35, to write our expectation value as 



{(Ph(pk2-(pk„\ P 

ki,k2 k, 



\^h^h-^h.){^n'Pi2-4h\ d,„\(pk,(pk2-(pk„) ' (2-24) 



VJlj2,...,/„ 



where \(pj^(pj2.:(pj„'^ corresponds to a basis vector. This becomes 

Y{(pk,<pk2---K\p\(Pn^i2---(pj,){(ph(pi2---(pj.]0„,\(pk,(pk2---^^^^^ (2.25) 

If the observable O acts as identity on all but the mth subsystem, by eqn. 1.43, we have 



Y{(pk,<pk2--4k„\ p\<Pn(pi2---(pi.) {(Pi^0„,\(pk,,) {(pj,...(pj^^^^^^ (2.26) 
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Since our chosen basis is orthonormal, for any non-zero term in the sum, we must have j = k 
(except for and k„,), in which case the final inner produce is unity. Hence, we get 

X {(Ph(pk2---(pk,]p\(ph--4k,„-i(pi„,(pk„,+i---^^^^^ (2.27) 

h,k2,...,kn,jm 

If we apply eqn. 1.43, letting p = pi ® p2 ® ■ ■ ■ ® Pn, we have 

Yj {'^^^ I pi P2 \<i^h) ■ ■ ■ {(pk,„\pm \(pj„) ■ ■ ■ {(pk„\pn \(pk„) {(pj„.\d,„ \(pk„), (2.28) 

k-i,k2,...,k„,im 



or 



Tr (pi) Tr (p2) ■ ■ ■ {ct>t„. \ p,„ |(/);,„) ■ ■ ■ Tr (p„) | d„, |</,,,„) . (2.29) 

km.jm 

Since each trace is just a scalar, we can write 

( \ 

{"t"^- 1 (Pi) Tr (P2) ■ ■ ■ Pm ■ ■ ■ Tr (p„) |<|),„) | d„, |(/),„,) . (2.30) 



kill 



Recognizing the definition 2.7 for the reduced state operator and the resolution of the identity from 
eqn. 1.35, we find [4] 




Due to this remarkable result, we know that the reduced state operator for a particular subsystem 
is enough to tell us about any observable that only depends on the subsystem. Further, we end up 
with a formula for the expectation value of a component observable very similar to axiom 2.3 for 
observables of the full system. 

2.4 Quantum Superposition 

Though we have introduced some of the basic formalism of the state, we are still missing one of the 
key facets of quantum mechanics. This piece is the superposition principle, which, at the time of 
this writing, is one of the core aspects of quantum mechanics that no one fully understands. 
However, due to repeated experimental evidence, we take it as an axiom. 
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Axiom 2.8 (Superposition Principle). Suppose that a system can be in two possible states, 
represented by the state vectors |0) and Then, 



\ip) = a\0) + , 

where a,/3 e C, is also a valid state of the system, provided that |a| + /3 = 1. 



(2.32) 



The superposition principle allows us to create new and intriguing states that we would not have 
access to otherwise. In fact, if we have n linearly independent states of a system, any point on the 
unit n-sphere corresponds to a valid state of the system.^ If we consider a two-state system with an 
orthonormal basis 1 10) , |1) |, the 2-sphere of possible states guaranteed by the superposition 
principle is conveniently visualized imbedded in 3-space. This visualization of a two-state 
system is called the Bloch sphere representation, and is pictured in figure 2.1 [6]. To calculate the 
position of a system in Bloch space, we use the formula 



(2.33) 



where |r) is the 3-vector, 



\r) = n \ei) + r2\ei) + rj les). 



(2.34) 



and a is the vector of Pauli spin matrices. 



\o) = ffv ki) + Oy \e2) + CTz ka) • 



(2.35) 



The Pauli matrices are 



' 1 



1 



(2.36) 



*The reader might wonder why the superposition principle is necessary, after all, we know that 
state vectors exist in a Hilbert space, and Hilbert spaces act linearly. However, we were not 
guaranteed iintil now that any vector of unit norm in Hilbert space represents a valid physical 
situation. The superposition principle gives us this, which allows us great freedom in constructing 
states. 
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Figure 2.1: Two-state systems can be visualized as being vectors on a two-sphere, known in quantum 
physics as the Bloch sphere. The angles cp and are defined in eqn. 2.48 for pure states, and the axes x, y, and 
z are defined in eqn. 2.35 for all states. 



-/ 

1 



(2.37) 



and 



Writing eqn. 2.33 explicitly, we find 



1 



-1 



(2.38) 



To + r3 ri - ir2 



(2.39) 



This is trivially a basis for all two by two matrices, so we can indeed represent any p by eqn. 2.33. 
Further, if we use the fact that Tr (p) = 1, we know 



Tr (p) « (ro + rs) + (j-q - rs) = Itq = 1, 



(2.40) 
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so ro = 1/2- With this constraint in mind, it is conventional to write eqn. 2.33 as [6] 
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Also, since p is self-adjoint, the diagonal entries must all be real, so e M. By the same reasoning. 



n + ir2 = in - iriY- 



(2.42) 



Since ri and r2 are arbitrary, we can choose either of them to be zero, and the resulting equation 
must hold for all values of the other. Hence, fi - r* and r2 = y*2i so both r\ and r2 are real, and \r) is 
a real-valued vector Since |r) is real, we use it as a position vector that tells us the location of the 
system in Bloch space and call it the Bloch vector. If we have a pure state 



U) = a|0) + |S|l), 



(2.43) 



we can express the location of the state in terms of the familiar polar and azimuthal angles of 
polar-spherical coordinates. Taking into account our redefined, conventional |r), eqn. 2.41 is 



1 + (rja) 



1 + n Tr- ir,, 



V ''•■^ + "'1/ 



(2.44) 



We use the polar-spherical coordinate identities for unit vectors 



r,. = sm U cos (p, 



sm y sm G), 



= cos y. 



(2.45) 
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to determme 



1 + r-, j-v - ir,. 



1 + cos 6 sin cos (|) - i sin sin (p 

sin cos (/) + i sin sin (p 1 - cos 

1 + cos sin de~''>' 



sin 06"^" 1 - cos 6 

l[ 2cos2(f) 2sin(f)cos(f)e-'* ^ 

2[ 2sin(f)cos(f)e''^ 2sin2(f) 

cos2(f) sin(f)cos(f)e-'* 
^ sin(f)cos(f)e"^ sin^d) 



If we let a = cos (0/2) and /3 s e"!' sin (0/2), the right side of eqn. 2.33 becomes 



lap a/3* 



Hence, the state vector of the pure state is [6] 



(2.46) 



(2.47) 




We note that the coefficient on |0) is apparently restricted to be real. However, unlike state 
operators, state vectors are not unique; physically identical state vectors may differ by a phase 
factor e'> [4]. 

The notion of superposition also enables us to refine our classification of composite systems. 
Besides distinguishing between pure and impure states, physicists subdivide composite pure 
states into two categories: entangled states and product states. 



Definition 2.9 (Product State). Suppose p is a pure composite quantum system with 
associated state vector If there exist state vectors \(pij and \(p2j such that 

\ip) = \(Pi)®\cP2), (2.49) 
then we call [i/^) a product state. If no such vectors exist, then we say \ip} is entangled. 
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To construct entangled states, we take product states and put them into superposition. In 
illustration of this concept, we consider the following example. 



2.5 Example: The Bell State 

An important example of an entangled state of two two-state systems is called the Bell State. 
Before we define this system, we need to develop some machinery to work with two-state systems. 
We use the orthonormal basis set introduced previously for a single, pure, two-state system, 
1 10) , |1) |, which we represent as column matrices by 



|0> « 



|1) « 



1 

V / 



(2.50) 



In this representation, we define an orthonormal basis for two of these two-state systems as [6] 



j 10) ® 10) , 10) ® ID , ID ® 10) , ID ® ID } = j 100) , |oi) , |io) , |ii) }, 



(2.51) 



which have matrix representations 



100) 



110) « 



101) « 




1 






/ ^ 




|1D« 



(2.52) 
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By the superposition principle, we define the state 
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100) + 111) 
V2 



1 

V2 





1 

V V2 



(2.53) 



which is the Bell state. To check if this state is entangled, we see if we can write [i/'g) = \(pAj ® |(/Jb) 
for some vectors \(pAj arid |(/)b)- As matrices, this equation is 



1 



V V2 












h 







a? 







( \ 



(2.54) 



This is a system of four simultaneous equations, = flif'i, = aib2, = axbi, and ^ = fl2^'2- Since 
= aibi, ai + and bi + 0. Then, since aib2 = 0,b2 = 0. But = ^2^2/ so 172 i= 0, which is a 

contradiction. Hence, and |(^b) do not exist, so [i/^g) is entangled. [6] 

Next, we compute the state operator corresponding to \iI>b)- By definition 2.4, since the Bell 

state is pure by construction, its state operator is 



|00) + |11)\/<00| + <11| 



V2 /\ V2 
|00) (00| + |00) (111 + 111) <00| + 111) (111 



i i 





1 I 



(2.55) 



Even though we constructed the Bell state from a state vector, we will explicitly verify its purity as 
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an example. We find 

/ / 100) <00| + 100) (111 + 111) <00| + 111) (111 \ /|00) (001 + 100) (111 + 111) (001 + 111) (11|\ 

= ( 2 )( 2 ) 

2 (|00) (00| + |00) (111 + 111) (00| + 111) (111) 
4 

(100) (001 + 100) (111 + 111) (001 + 111) (111) 
2 

= p, (2.56) 

which confirms that the Bell state is pure. 

Next, suppose we want to measure some particular facet of the first subsystem. Since the Bell 
state is entangled, we cannot "eyeball" the result, but rather we need to use the reduced state 
machinery we developed in definition 2.7. The reduced state operator for the first subsystem is 



= Tr2(p) 

'100) (001 + 100) (111 + 111) (001 + 111) (111 ^ 



Tr2 

iTr2 (10) (01 ® 10) (01 + ID (01 ® |1) (0| + |0) (1| ® |0) (1| + |1) (1| ® |1) (1|) 

i( 10) (01 • Tr (10) (01) + ID (01 ■ Tr (|1) (0|) + |0) (1| ■ Tr (|0) (1|) + |D dl ■ Tr (|1) (1|) ) 

i(|0)(0|-l + |l)(0|-0 + |0)(l|-0 + |l)(l|-l) 
10) (01 + ID (1| 



I 



(2.57) 



Oddly enough. 



10) (01 + 11) (11 .(1) 

A P ' 



so p^^^ is impure [6]. Surprisingly, a pure composite system does not necessarily contain pure 
subsystems. If we express p^^^ in terms of the Pauli matrices and the identity as in eqn. 2.33, we 
find that the Bloch vector corresponding to p*^^ is \r) = 0. We already noted that in Bloch space, the 
unit two-sphere represents all the possible pure state configurations for a two-state system. 
However, the unit ball represents all state configurations; the impure states have (r\r'^ < 1 [6]. The 
Bell state, with (r^rj = 0, is a special case of a totally mixed or impure state, meaning that the 
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subsystem is entirely statistical (classical). By symmetry if we had traced out the first subsystem 
rather than the second, we find p^^' = p'-^\ so we actually have an entangled state composed of 
totally classical subsystems. 



2.6 Projection Onto a Basis 

So far, we have worked mostly in an abstract Hilbert space, although we have taken brief forays 
into matrix representations of states and observables. In this section, we formalize the notion of a 
representation of an operator in a basis. We are mainly interested in infinite and continuous bases, 
which we use to define a very useful structure [13]. 



Definition 2.10 (Wavefuntion). Suppose that we have an infinite and continuous basis 
for 'H, {x]xeR- Then, for some pure state vector e "H, we form the wavefunction 



defined by 



i/; : R ^ C (2.58) 
i/;(x) s (x\ip) . (2.59) 



We note that if 



\ip) = J^a,\x), (2.60) 
{4>\=Y^al{x\, (2.61) 



A-eR 



SO 



Tr (p) = i^k) i^^) = E •/'* WV'C^)' (2-62) 

.reR xeU 

where we have used the complex symmetry of the inner product given by eqn. 1.6. But since this is 
a sum over a continuous interval, it can be written as an integral. We obtain 

Tr(p) = J dx- ip'{x)ip{x) = Jdx- \ijj{x)f = 1, (2.63) 

as the state operator has unit trace. Since our sum is infinite, it must be that ip{x) decays at infinity 
sufficiently fast such that the integral converges. This special class of functions is known as the set 
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of square-normalizable functions, and is often denoted as L^. Physically, this means that the 
wavefunction must be localized in some sense, so that at extreme distances it is effectively zero. 

Just as we projected a vector into a basis and obtained a fimction, we can project a linear 
operator acting in Hilbert space onto a basis to obtain a linear operator in function space. We 
denote such operator with a check ("), and define it by [13] 



elements, given by eqn. 1.21, of a state operator p in the position basis. If p is pure, then 




(2.64) 



where £) is an operator on a Hilbert space. An interesting application of this considers the matrix 




(2.65) 



Since we previously established that every valid wavefimction must vanish quickly at infinity, it 
follows that sufficiently off-diagonal elements of the state operator must vanish quickly, as well as 
distant points along the diagonal. 



chapter v— 
Dynamics 



Quantum dynamics is the framework that evolves a quantum state forward in time. We 
begin by considering the Galilei group of transformations, under which all non-relativistic 
physics is believed to be invariant. We show that this group leads to the fundamental commutator 
relations that govern quantum dynamics, and then use them do derive the famous Schrodinger 
equation. Finally, we consider the free particle in the position basis. 

3.1 The Galilei Group 

Fundamental to the notion of dynamics is the physical assumption that certain transformations 
will not change the physics of a situation [4]. All known experimental evidence supports this 
assxm\ption, and it seems reasonable mathematically. This set of transformations forms a group, 
called the Poincare group of space translations, time translations, and Lorentz transformations.^ 
However, for our purposes we take v c, so the Poincare group becomes the classical Galilei 
group, which we take as an axiom. For clarity, we assume a pure state in one temporal and one 
spacial dimension, but this treatment can be readily extended to impure states in three-dimensional 
space [16]. 

^The term group here is used in the formal, mathematical sense. We will not dwell on many of 
the subtleties that arise due to this here, and the interested reader is directed to ref. [15]. 
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Axiom 3.1 (Invariance Under the Galilei Group). Let G be the Galilei group, which 
contains elements generated by the composition of the operators 



Se'lpix, t) = \p{x + £, t) 
fe^{x, t) = ^'{X, t + e) 

Li;\p{x, t) = \p{x + et, t), 



(3.1) 



where i/'(x, f), given by definition 2.10, is a function of position and time, and Q represents 
an operator on the space of such functions, as defined in eqn. 2.64. Let g ^ G and let A be 
an observable of the state \\p) with eigenvectors { \(pn') ^rid eigenvalues {fl„}„6R- 
Then, if A = a„ |(^„^ and for all wavefunctions v{x, t), gv = v' , we assert 



and 



(3.2) 



(3.3) 



In essence, eqns. 3.2 and 3.3 refer to the invariance of possible measurement and invariance of 
probable outcome, and thus the invariance of all physics, imder the Galilei group. We now write a 
motivating identity using the Galilei group. Considering the state wavefunction t), we find 
[16] 



L-^T;^LJ,il>{x,t) 



= L-ef-eLefelp{x, t) 

= L-et-eLelp{x, t + £?) 

= L-ef-eip{x + e{t + e),t + e) 

= L-eip{x + e{t + e), t) 

= \p(x + e(t + £)- et, t) 

= ^{x + e^, t) 

= Sg2lp{X,t). 



(3.4) 



We conclude that these transformations do not commute, which will play a major role in the 
dynamics of quantum mechanics. Before we move to a Hilbert space, we need to convert our 
Galilei group into a more useful form. Due to eqn. 3.3, we can make use of Wigner's theorem, 
which guarantees that any GaHlei transformation corresponds to a unitary operator !J on a Hilbert 
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space that obeys^ 

UU* = U^U = i. (3.5) 

Thus, if Q is a unitary representative of a Galilei group member and A is an observable. If we take 
that 

|m')s!J|m) (3.6) 

for all \u) e "TY, we have 

A' \cp'„) = an !</,;) ^ A'U \(P„) = a„U \(P„) , (3.7) 



so 



Lr+A'!j|(/)„) = (lJ+G)fl„ |c/)„) = a„ |(/)„). (3.8) 



Hence, we get 



A \(\>n) - ifA'-a \(Pn) = an \(pn) " \(pn) = 0. (3.9) 

Since this equation holds for all eigenvectors of A, we have [4] 

A - U^A'U = 0^A = U^A'U ^A' = UAU\ (3.10) 

We now take our unitary transformation to be a function of a single parameter, t, subject to 
U{ti + t2) ~ U{t\)U{t2) and U{0) = 1. Then, for small t, we take the Taylor expansion of U about 
t = to get^ 

+ .... (3.11) 



t=0 



^Wigner's theorem is complicated to prove. See ref. [17] for a thorough treatment. 

^We will be making frequent use of the Taylor expansion. Readers unfamiliar with it are 
advised to see ref. [10]. 
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1 = UU'^ 



= 1 + f- 



dUW 

IT 



+ .. 



f=0 



idU dlP\ 



as t ~ and 0(0) = !J+(0) = i. Since i = UU* for all t, it must be that 



(3.12) 



dU dU^ 
'dt ~^ IT 



= 0. 



f=0 



(3.13) 



We now let 



dU 
dt 



f=0 



= iK, 



(3.14) 



which is well-defined so long as K is self-adjoint. We impose the boundary condition !J(0) = 1 to 
find the solution to this first order differential equation. 



U{s) = e 



iKt 



(3.15) 



Since any unitary operator can be represented in this form, we now define the three generating 
operators of the Galilei group. They are [16] 



S,4>{x) = <x| Sv \ip) = {x\ e-"P \ip) 
ftipix) = {x\ft\il^)^{x\e-"''\,p} 
U^{x) = {x\LJ4>) = {x\e''''f\4>), 



(3.16) 



where /, h, and p are self-adjornt, and the particular signs and parameters associated with the 
transformations are matters of convention. 
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3.2 Commutator Relationships 

We next introduce three particular observables. First, the position operator, Q, obeys the eigenvalue 
equation 

Q\x)=x\x), (3.17) 

where |x) is an eigenvector of the position, i.e. a state of definite position. Second, the momentum 
operator, P, follows 

P\p) = p\p). (3.18) 
We require that the expectation values of these operators follow the classical relationship [2] 



(P).-^. (3.19) 

Further, we define the energy operator H, also known as the Hamiltonian, in analogy to the 
classical total energy of a system, which is the kinetic energy P^/{2m) plus some potential energy V. 
It is 

H s J-P^ + V. (3.20) 

First, note that 

so [h,P] = 0. Next, recall that 

\^{t + e)) = f,\4>{t)). (3.22) 
By the definition of the derivative, we have [16] 

rffl'^(^)> = e 

e-'^>'\4;{t))-\^{t)) 
= lim 

fT^o e 



= lim 

fT^O 



(1 - ieh + (-iehf 12 + ...) \ip{t)) - \xp{t)) 



= ]im(-ih\^{t))-eP\m) + -) 

= -ih\m)- (3-23) 
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Following identical logic, we find [16] 



-{^{t)\ = +i{ip{t)\h. (3.24) 



Since |i/'(f)) is pure, we use eqn. 2.14 to write 
d 



|(!/.(f)||Q|i/'(f))+<'/'a)|Q(^|w) 

i{4>{t)\hQ\xjj{t)) - {ip{t)\Qih\m) 
{^{t)\i(hQ-Qh)\m) 

{ip{t)\i[h,Q]\ip{t)), (3.25) 



so 



i^{Q)ii) = {iM)- (3.26) 

Then, by eqn. 3.19, we have 

^ (p) = {i [h, q]) « m\ U> 1^(0) = mi i [h, q] \m) ■ (3.27) 

Since this result holds for arbitrary |i/'(0)/ we get 

-P = i\h,0\, (3.28) 

or [16] 

IqM = i-P. (3.29) 

We next continue working with the position operator to derive a second relation. Recall that 
from eqn. 3.10, a unitary transformation defined by 

\f) = u\\l^) (3.30) 



transforms an operator as 



A' = UAU\ 



(3.31) 



3. Dynamics 



44 



So, if our unitary operator is Sxo - £ we can transform the position operator Q to Q' according 
to 

Q' = S,„QSi = e-«°PQe+«°P. (3.32) 



By our definition of Q, we know* 



Q\x)=x\x) ^ Q\x')=x' \x'). 



(3.33) 



Further, eqn. 3.2 tells us 



Thus, 



Q' \x')=x\x'). 



(3.34) 



(Q' - Q) I^') = {x- x') \x') = {x-{x + xo)) \x') = -xo \x') .5 (3.35) 
Note that this relationship holds for arbitrary xq, and hence for all \x'). This implies [16] 



(3.36) 



Recalling our definition for Q', we have 



(3.37) 



As before, we expand the exponential terms in a Taylor series to obtain 



11=1 



= (1 - ixop + ...) Q (1 + ;xop + ...) 

= Q- ixopQ + ixoQp + ... 

= Q + ixo{Qp-pQ) + ... 

= Q + ixo[Q,p] + ... 

= Q-xq. 



(3.38) 



''This is because |x) and |x') are valid eigenvectors of Q, as the spectrum of allowed positions 
(the eigenvalues for Q) is the entire real line. 

^Note that x' = x + xq, since S.v-„i/^(x) = ip{xQ + x) = i/'(x'). 
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Hence, in the limit as Xq 0, eqn. 3.37 is [16] 



:lQ,p] = -l^[Q,p\=i 



Next, we examine the momentum operator. Taking our imitary operator to be L„ 



If we operate on states of definite momentum, we know 



P|p) =p |p) = mv^p) . 



By direct analogy with the states of definite position above, we find [16] 



P' = e+'^'of pQ-ivof = p - 



mvQ. 



As above, we find the Taylor expansion of the exponentials to obtain 



ES^^ES^ - (i+f.o/+...)p(i-f.o/+...) 



(!=1 



11=1 



= P + ivofP-ivoPf + ... 

= P + ivo{fP-Pf) + ... 

= P + iVo[f,P] + ... 

= P - mvQ. 



In the limit as Cq ^ 0, we have 



■\f,P] = -m^[f,P] 



im. 



It is a convention to define / = mq, in which case we have [16] 
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We now have 



[h,p] = 0, 
[0,1^1 = i-P, 

[q,P] = i. (3.46) 



We make the standard definition for the position, momentum, and energy operators in terms of the 
Gahlei group generators. It is [16] 

QsHq, P = np, H = nh, (3.47) 

where is a proportionahty constant known as Planck's reduced constant, and is experimentally 
determined to be 

n ^ 10"^* joule - seconds (3.48) 



in SI units. Then, eqn. 3.46 reads [16] 



[P,h] = 0, 
\q,h] = m-P, 

[Q,P] = in, (3.49) 



where 




is especially important, and is called the canonical commutator. 

As a consequence of our work so far this chapter, we now are in the position to evolve a state 
operator p in time. From eqn. 3.10, we have 



A' = UAU' 



(3.51) 
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for an arbitrary observable A. Letting A = p , the state operator, and U = Tt - e~'^^^'', we have 

p' = e-""/''pe^''"'\ (3.52) 

Thus, by the definition of the derivative, 

p' - p g-mnp^itH/n _ Q 
d,p = lim = lim . (3.53) 

Expanding the exponential terms in a Taylor series, we get 

, (l-f + -)p(l + f + 
d,p = hm 

iH iH 



,h], (3.54) 



so the equation of motion for the state operator is 




3.3 The Schrodinger wave equation 

Now that we have the commutator relations in eqn. 3.49, we can touch base with elementary 
quantum mechanics by deriving the Schrodinger wave equation. We work in the position basis, 
where our basis vectors follow 

Q\x) = x\x). (3.56) 

Considering some state vector 

\iP) = J^a,\x), (3.57) 
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its wavef unction, given by definition 2.10, is 

l/'(x) = (^x\^^ - {x\ ^ |x) 



= flr 



(3.58) 



Considering Q, we find by eqn. 2.64 that 



Qip{x) = {x\ Q li//) = {x\x\ip) =x (^x\ipj = xip{x). 



(3.59) 



So, in the position basis, Q turns out to be multipHcation by x. Using this result with eqn. 2.14, we 
find [2] 



= (mm 



\Y,al{x\ Q Y,^y\y) 

\xeR ) \y€R 

( )( 
\xeR ) \y€R 

( If 

a'x (x\ ayy \y) 

\xe& ) \y€R 

Yj "--^^"^ 
.veR 

dx ■ i/'(x)*xi/^(x) 



(3.60) 



We would like to find a similar expression for momentum within the position basis. To do this, we 
consider the canonical commutator from eqn. 3.46, [Q/^] = which corresponds to 



(Iq,P] = in) ^ [Q,-P] ipix) = ihipix) 



(3.61) 
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Considering some dummy function f{x), we have [2] 

ndf ndf 

infix) = X-- — X-- — hinf 
1 ax I ax 

ndf ndf n , 

= X-— -X-— - -f 
I dx I dx I 

ndf nd 

1 dx 1 dx 
n d n d . 
i dx i dx ' 

= {QP-PQ)f- 

Since we know Qf = xf in the position basis. 



I dx 



We now drop our test fimction to obtain the famous operator relationship 



i dx 



Now, recall from eqn. 3.22, 



It follows that 



SO 



dt^ptix) = {x\d, \ipt} = {x\- — \\l>,} = -- {x\H\\l>t) = -i^Hiptix). 
But by eqn. 3.20, 

H^x) = {x\ H\xj,t) = {x\ (^P' + v) \iiJt) = ^P%i^) + ^Mx)- 

Hence, we have 
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This is rewritten as 



dlb(x,t) fi2 ^2^(x f) 



and is the time-dependent Schrodinger equation [2]. 

Remarkably, so long as V is time-independent, this equation turns out to be separable, so we 
can effectively pull off the time-dependence. To do this, we suppose [2] 



ip{x,t) sip{x)<p{t), (3.71) 



and substitute into the Schrodinger equation. We have 



which is 



dih(x)(p(t) Jf d^\p(x)(p(t) 



d(p{t) fi2 d^Mx) 
m^ix)^ + ^(Pit)-^ = VipixMt), (3.73) 



or 



(pit) dt ~ Im^pix) dx2 

provided (p{t), \p{x) + 0. We now have two independent, single-variable functions set equal , so we 
know each of the functions must be equal to some constant, which we name £. That is, we have [2] 

where we have let the partial derivatives go to normal derivatives, since we now have 
single-variable functions. The time-dependent piece has the solution 

^{t) = e-'^'/^ (3.76) 



and the time-independent piece is usually written as [2] 
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fe2 

— £l?!/^(x) + Vip{x) = E^{x), (3.77) 



which is the time-independent Schrodinger equation. Although this result cannot be reduced 
further without specifying V, we can use eqn. 3.68 to find 

H^{x) = Eij^ix). (3.78) 

This means that the values for the separation constant E are actually the possible eigenvalues for H, 
the position representation of the Hamiltonian (energy operator). Further, if we find i/'(x), we can 
construct i/'(x, t) by 

^{x, t) = (p{t)4>{x) = e-'^''^'ip{x). (3.79) 

If we compare this to eqn. 3.22, 

\^t) = ft\4>t=o) = e-''"'"\^o), (3.80) 

we find a distinct similarity between the form of time evolution in Hilbert space using the unitary 
Tf operator and time evolution in position space using the complex exponential of the eigenvalues 
of the associated H operator on function space. 

3.4 The Free Particle 

Now that we have derived the Schrodinger equation, we will put it to use by treating the case of 
a free particle, when the potential V = 0. In this case, the time-independent Schrodinger equation 
(eqn. 3.77) reads 

- ^dXx) = E^{x), (3.81) 

which we write as 

d^^(x) = -k^4>{x), (3.82) 



where 



ic.^. (3.83) 
n 
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This equation has a solution [13] 
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(3.84) 



which is sinusoidal with amplitude A. Note that we identified the constants in our equation as k 
with some foresight, as it turns out to be the wave nimiber, k - 2n/A, of the solution. However, 
this solution does not decay at infinity, so the condition imposed by definition 2.4 is violated. That 
is [2], 

( ]( 

\xeR ) \i/€K 

xeV. yeR 

X€lR 

- J dx ■ ijj* {x)ijj{x) 
= \Af J dx 

= oo, (3.85) 



so we cannot pick appropriate A such that (i/'l'/') = 1. Hence, must not be a physically 
realizable state. The resolution to this problem is to use a linear combination of states with 
different values for A. The general formula for this linear combination is [2] 



i/'(x) 



dk ■ (p{ky^\ 



(3.86) 



where (p{k) is the coefficient that replaces A in our linear combination. Each of the component 
states of this integral are called plane waves, while the linear combination is called a wave packet. 
We will make use of the plane wave components for free particles later, so we need to investigate 
their form further. Consider the eigenvalue problem 



Pfp(x) = pfp{x), 



(3.87) 
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where fp is an eigenfunction and p is an eigenvalue of the momentum operator in the position 
basis. Using eqn. 3.63, we write 

-Jjp{x) = p/p(x). (3.88) 

This has a solution [2] 

fp{x) = Ae'P''\ (3.89) 

which is of identical form to eqn. 3.84. If we identify the eigenfunctions of the position operator 
with the plane wave states, we get the famous de Broglie relation [2, 4, 11, 13], 

p = hk. (3.90) 

Recall that plane wave states are not normalizable, and thus cannot be physically realizable states. 
This means that in the position basis, states of definite momentum are not permissible, which is a 
famous consequence of the Heisenberg uncertainty principle. The wave packet, then, can be 
thought of as a superposition of states of definite momentum, giving rise to a state of definite 
position. That is [2], 

ijjix) = Jdp- <p{p)e'P\ (3.91) 
where we have switched to units in which H = 1, as we will do for the remainder of this thesis. 



^The uncertainty principle reads AxAp > h/2 [2]. If we have a state of definite position, Ap = 0, 
so, roughly. Ax = oo. This is the result that we have already seen; states of definite momentum are 
not square-normalizable in the position basis. 



CHAPTER 



The Wigner Distribution 



iHe Wigner distribution was the first quasi-probability distribution used in physics. Invented 



-1- in 1932 by E.P. Wigner, it remains in wide use today in many areas, especially quantum 
mechanics and signal analysis [18]. The Wigner distribution has been used to develop an entirely 
new formalism of quantum mechanics in phase-space, a space of position vs. momentum, which 
we touch on briefly in section 4.5 [19]. 

In the following chapter, we first define the Wigner distribution and derive some of its 
fimdamental properties. Next, we discuss the Wigner distribution of a combined system and treat 
a free particle. Following that, we extend the distribution to its associated transform. We then 
create a table of useful inverse relationships between the state operator and Wigner distribution 
required in subsequent sections. Finally, we construct the Wigner distribution for a simple 
harmonic oscillator as an example and observe its correspondence to a classical phase-space 
probability distribution. 

4.1 Definition and Fundamental Properitees 

In this section, we explore the basic properties of the Wigner distribution, starting with its 
definition, which is stated below [20] 
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Definition 4.1 (The Wigner distribution). Consider the matrix elements of some state 
operator, given by p(x, y) - {x\p\y). Then, the Wigner distribution W associated with p 
is given by 





where time dependence is understood and not written explicitly. 



Note that the Wigner distribution is given by a special case of the Fourier transform of 
the state operator with respect to the mean (x) and difference (6) coordinates.^ Using this definition, 
we now list and verify some of the well known properties of the Wigner distribution. 

4.1.1 Inverse Distribution 

As one might guess, just as the Wigner distribution is defined in terms of the state operator, it is 
possible to define the state operator in terms of the Wigner distribution. This distinction is 
arbitrary: valid formulations of quantum mechanics have been made with the Wigner distribution 
as the primary object, while the state operator takes a secondary seat. However, historically the 
state operator and its associated vector have been the objects of primary importance in the 
development of quantum mechanics [21]. If we wish to express a state operator in terms of an 
associated Wigner distribution, we can make use of the relation [22] 



In order to show that this is well-defined, we note that the Plancherel theorem states [2] 



for some function / and its Fourier transform, !F(/), so long as the functions decay sufficiently fast 
at infinity. From this, is evident that the state operator is a kind of Fourier transform of the 

^ We assume some basic familiarity with the Fourier transform. If this topic is unfamiliar, the 
reader is advised to see ref. [10]. 




(4.3) 




(4.4) 
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Wigner distribution, as we claimed in the previous section, so our inverse relationship is indeed 
appropriate. 

4.1.2 Reality of the Wigner Distribution 

One of the most important of the basic properties we will cover is that the Wigner distribution is 
always real-valued.^ That is. 



In order to show this, we will take the complex conjugate of W{x,p). This gives us [23] 



where we used eqn. 1.25 for the self-adjoint operator p. Since we found W'{x,p) = W{x,p), we 
have W{x,p) e R, as we claimed in eqn. 4.5. 

4.1.3 Marginal Distributions 

Based on our definition of the Wigner distribution, we note two important marginal distributions. 
They are [24] 



Wix,p,t) e ]R. 



(4.5) 




= W(x,p), 



(4.6) 




(4.7) 



^Although it is real-valued, the Wigner distribution is not always positive. It is called a 
quasi-probability distribution since it is analogous to a true probability distribution, but has 
negative regions. We will deal these apparent negative probabilities more in section 4.5. 
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and 




To show these results, we recall the definition of the Wigner distribution. We have 

J dp ■ W{x,p) = J'^P-:^J'^^- ^"'''V is) 
= ^jd6-p{x+^-5,x- Ifi) J dp ■ e-'P^ 
= ^ J d6 ■ p(x + ^5,x - ^dj Indoid) 

= J d6-5D{5)p{x+^6,x-^6j 
= p{x,x) 

= mp\x), (4.9) 
where do is called the Dirac delta/ and has the important properties [10] 

Jdx-6D{y)f{x + y)^f{x) (4.10) 

and 

J dx ■ e-'"J s Indoiy). (4-11) 

In preparation for calculating the postion marginal distribution, it is useful to discuss the 
momentum representation of the state operator. Analogous to the position representation, we 
define 




^The Dirac delta is roughly a sharp spike at a point, and zero elsewhere. Technically, it is not 
quite a function, but it is a very useful construct in theoretical physics. For more information, see 
ref. [10]. 
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where the (~) is used to distinguish position matrix elements from momentum matrix elements. In 
terms of the momentum representation, the Wigner distribution is 

W{x,p) ^ Wp{x,p) ^^JdA- e-^'p {p +\a,P- (4.13) 

where p = and A = p - p' are the average and difference momentum coordinates, in direct 
analogy to x and 6. We are now ready to calculate the position marginal distribution. We have 

J dxW{x,p) J dxWp{x,p) 

= ^jdA-p{p+^-A,p- ^a) J dxe-'^' 

= ^ JdA-p(p + iA,p-iA)27i6D(6) 
= J dA-5D{d)p{p + ^A,p-^A^ 



^ p(p,p) 

= {p\p\p), (4.14) 



which is what we claimed. 



4.2 Wigner distributions of combined systems 

Recall that in section 2.3, we defined the state operator of a composite system as 

Pi+2 = jOi ® pi, (4.15) 

where (®) is the tensor product. In analogy to this, we define the Wigner distribution of a 
composite system to be [22] 



Wi+2{Xl,X2,pi,p2) = Wi(Xi,pi)W2(X2,p2)- 



(4.16) 
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In section 2.3, we also developed the partial trace, which was a method for extracting information 
about a single sub-state operator in a composite state operator. Not surprisingly, we define 
an analogous operation, which effectively annihilates one of the sub- Wigner distributions in 
a composite distribution by integrating out the degrees of freedom of the sub-distribution. 
Formally, we call this the projection fimction Ji : W1+2 Wi and define it as 

^(Wi+2) = JrfX2dp2Wi+2. (4.17) 

To understand how it works, we evaluate it on the initial total Wigner distribution. This is 

= J dX2dp2Wi{Xi,pi)W2{X2,p2) 
= Wi(Xi,pi) J dX2dp2W2{X2,p2) 
= Wi(Xi,pi) J dX2p2{X2,X2) 

= Wi(xi,pi)Tr(p2) 

= Wi(xi,pi), (4.18) 

where we have used eqns. 4.7 and definition 1.10 to integrate W2 and perform the full trace of p2- 
Thus, ^ behaves as desired, in direct analogy to the partial trace on composite state operators. 

4.3 Equation of Motion for a Free Particle 

Now that we have laid out the basic properties of the Wigner distribution, we need to understand 
how to use it to describe a physical system. In this section, we investigate how a Wigner 
distribution evolves in time in the absence of a potential. Recall that in section 3.4, we established 
the Hamiltonian of a free system as 

H=— . (4.19) 

2m 

Given the Hamiltonian, we can calculate the time evolution of the state operator of the system via 
the commutator relation 

dtp = -i[H,p], (4.20) 
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d,p = i (pH -Hp) = ^ (pP' - P'p) , (4.21) 

noting that m is a scalar. So far, we have a general operator equation for the evolution of the 
system. If we want to know more specific information about its motion, we need to choose a basis 
onto which we may project our equation. Choosing momentum, we multiply both sides of the 
equation by {p| from the left and from the right, where p and p' are two arbitrary momentum 
states of our system. This gives us 

{^\d,p\f') = {p\^[pP^-P^p)\p'). (4.22) 

Since p and p' are states of definite momentum, they are eigenvalues of P. Hence, P |p) = p |p), 
{p| P = (p| P/ and likewise for p'. So, our equation of motion becomes 

<p|<9,p|p') = (p| ^ (pp2 - p2p) |p') 

= ^(P\PPP'\P')-^{P\PPP\P') 
= ^(P\PP'\P')P' - ^(P\PP\P')P 



'-{P\P\P'){P"-P') 



2m 
i 

2m 



J-{p\p\p'){p' -p){p' +p). (4.23) 



Next, we substitute difference and mean variables, A and p, for p and p' by defining A = p - p' and 
2p = p + p'. This substitution is algebraically equivalent to p = p + A/2 and p' = p - A/2, so 

<p + A/2| dtp \p - A/2) = ^{P + A/2| p\p- A/2) 2pA. (4.24) 



4. The Wigner Distribution 61 

We multiply both sides of the equation by dA ■ e~"^'^/(27T) (the kernel of the Fourier transform) and 
integrate from A - -oo to A = +00. The left hand side is 

LHS = JdA-^e-'^^ip + A/lldtplp-A/l) 

= J dA-<9f^e-'^^-<p + A/2|p|p-A/2) 

= dt^JdA- e-'^' {p + A/2| p\p- A/2) 
= d,Wp{x,p,t) 

^ dtW{x,p,t), (4.25) 

where we use the fact that the only explicitly time dependent piece of the integrand is p. We 
also assume that the integral converges and, in the last step, we use eqn. 4.13 for the Wigner 
distribution in the momentum basis. Proceeding in a similar fashion on the right hand side, we get 

RHS = J dA- :^e-'^''^{p + A/2\p\p- A/2)2pA 

= ^/^'^•(r "i<P + ^V2|p|p-A/2) 

= -^^/'^'^■H"'"1<P + ^/2|p|p-A/2) 

= -mtf dA-d,(e-''^){p + A/2\p\p-A/2} 

= -^^^^J dA-e-'^^p + AI2\p\p-AI2) 

= -^d,Wp{x,p,t) 
m 

^ -^d^W{x,p,t), (4.26) 

where we use the fact that e~''^^ was the only factor in the integrand that explicitly depended on x. 
We again assume that the integral converges and use eqn. 4.13 for the Wigner distribution in the 
momentum basis. Thus, equating the right hand and left hand sides in the position representation 
leaves [24] 




which is the equation of motion for a free system in terms of its Wigner distribution. 
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Although it might seem convoluted to introduce the Wigner form of this equation rather than 
using the evolution of a free particle in terms of its state operator, the power of the Wigner 
distribution is that it allows us to treat position and momentum simultaneously. 

4.4 Associated Transform and Inversion Properties 

Now that we have determined some of the properties of the Wigner distribution, it is useful to 
define the Wigner transform of an arbitrary distribution of two variables. 



Definition 4.2 (The Wigner transform). Let D{x, y) be an arbitrary distribution of two 
variables, x and y, and possibly have an implicit temporal dependence. Then, the 
Wigner transform 'W of D, a special case of the Fourier transform, is defined as 

(D(x, y)) s i- J d6 . .e-v^D{x, y), (4.28) 

where 6 = x - y, as identified in definition 4.1. 



By definition, we know the Wigner transform of p{x, y) immediately. It is 

^ W' y)) = ^ / ■ ^"''VC^' y) = W p). (4.29) 

We arrive at a more interesting result by considering 

'W(dtp{x,y)) = J d6-e-'P^dtp{x,y). (4.30) 

Clearly, neither 5 nor e~'P^ depend explicitly on time. Assuming that the integral converges, we 
have 

J db ■ e-'P^dtp{x, y) = dt ^ db- e-'J'^p{x, y) = dtW{x, p), (4.31) 
where we have applied definition 4.2. So, 

^idtp{x,y))^dtW{x,p), (4.32) 

as desired. 
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In the following sections, we work out some of the Wigner transforms of functions that we will 
need later. The results of these derivations are summarized in table 4.1 below. 



Table 4.1: Wigner transforms of important quantities, where x = ^ and 6 = x - y. 



Expression 


Transform 


p(x,y) 


W(x,p) 


dtp{x,y) 


<?tW(x,p) 


i(dl-dl)p{x,y) 




(x-y) [d^-dy) p{x,y) 


-2<9p(p-W(x,p)) 


{x-yfp{x,y) 





4.4.1 The Wigner transform of (z/2) {d]. - p{x, y) 
By definition 4.2, 

y s (dl - p(x, y)) = Jd6 ■ e-'P^^ (^^ _ ^2 j ^(^^ (433) 

where we have defined V for our convenience. Then, we know from Clairaut's theorem that since 
all partial derivatives of the state operator are continuous, [(9xp(^/ y), dyp{x, y)j = [25]. V then 
expands to 

y = ^ J db- e-'v' (dl -d] + d,dy - dyd,) p{x, y). (4.34) 

We next note that by definition 4.1, x = x + 1/2 ■ 6 and y = x - 1/2 ■ 6, which impHes d^x = 1/2 and 
d^y = -1/2. Hence, 2(?gx = 1 and 2d{,y = -1. We rework V to be 

V = ^ ■ 2 J d6 ■ e-'''* {{d,x) dl + (d^y) d\ + {d,x) d,dy + {d,y) dyd,) p{x, y) 

■ [as -,pbil ^P(^'y)I^A ^^ . l ^P'y^'y)l^y \dy , i dp{x,y)/dy \dx ^ ( dp{x,y)/dx \dy) 
" 'j^^'' \\ dx ]db \ dy ]db \ dx ]db \ dy jdb) 

= ij db-e~'''^di,(d^p{x,y) + dyp{x,y)y 



(4.35) 



Now, by definition 4.1, 5jX = d^y = 1, so 



o»p(x,y) dp{x,y)dx dp{x,y)dy dp{x,y) dp{x,y) ^ , / ^ o.^ 

+ — ^ — = — ^ + — ^ = <7.yP(x, y) + dyp{x, y). (4.36) 



dx 



dx dx dy dx 



dx 



dy 
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Thus, we have 

Next, we integrate by parts to get 



V = (e-'P'd,p{x, y)) -ij dd- {d,e-'P') d,p{x, y). 



Noting that the state operator is continuous in the position basis, we find 



lim dxp{x,y) 



= lim d^p ix + \b,x - ifi] 
= d<c lim p{x+\b,x-\b\ 
= lim p{\b,-\b\ 



= 0. 
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(4.37) 



(4.38) 



(4.39) 



Further, 



so 



Hence, 



< le-'P'^l < 1V6, 



(e-'v'd,p{x,y))\ =Q. 



(4.40) 



(4.41) 



y = -f J db- (^se-'P*) d,p{x, y) = -i J db ■ (-ipe-'P') d,p{x, y) = -pd, j db ■ e-'^'p{x, y). (4.42) 



That is, by definition 4.1, 



^ - dfi p{x, y)) = -pd, j db ■ e-'P'pix, y) = -pd,W (x, p) , (4.43) 



which is what we wanted to show. 
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4.4.2 The Wigner transform of {x - y) [dx - p{x, y) 
By definition 4.2, 

V = ^({x-y) (d, - <9y) p{x, y)) = J d6 ■ e-'P'\x - y) (d, - dy) p{x, y), 

where we have again defined V for our convenience. Since b = x - y, we have 

y = J db-be-'v'(d,-dy)p{x,y). 

As we did in the previous section, we note that di,x = 1/2 and di^y — -1/2, so 2di,x 
2d{,y = -1. Thus, 

y = 2 rrf6.6e-'^'^f^| + ^|). 
Next, we use the chain rule to find 

y = ^J db-be-'P^dt,p{x,y). 

After that, we use integration by parts to get 

y = 2(6e-'PV(^.y))|" -2j^ db- p{x,y)di,[be-'P^). 

As before, we investigate the boundary term. The non-oscillatory component follows 



lim bp{x, y) = 0, 

6— >±oo 



since p{x, y) goes to zero rapidly off the diagonal, as we noted in section 2.6. Since 

< |e-'P*| < 1 V6, 



we have 



so 



lim e-'>*6p(x,y) = 0, 

6— >±DO 



y = -2 ^ db- p{x, y)di, [be-'P^) . 
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Finally, note that 

d,(6e-P") = dp(pe-'P'), (4.53) 

hence 

V = -2 J db- p{x, y)dp (pe-'P') = -2dp {pjdd- e-'P'p{x, y) J = -2dp (p ■ W{x, p)) . (4.54) 



That is. 



W ((X - y) {d, - dy) p{x, y)) = -2dp {p ■ W{x, p)) , (4.55) 

as desired. 

4.4.3 The Wigner transform of (x - yf p{x, y) 

By definition 4.2, 

V^^({x- yf p(x, y)) = ^db- e-'P^ (x - yf p(x, y), (4.56) 

where, as in the past two sections, we have defined V for our convenience. By definition 4.1, 

6 = X - y, so 

V = ^ db-e-'P^d^p{x,y) = J db ■ b^-'P^ p{x,y). (4.57) 

Now, since 

62g-/p6 ^ _ J;2g2j g-,p6 ^ ^dle-'P\ (4.58) 



we have 



V = Jdb- (-dyP^) p{x, y) = -dj J db ■ e-'P^p{x, y) = -dlW{x,p), (4.59) 



or 




which is what we wanted to show. 
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4.5 Example: The Wigner Distribution of a Harmonic Oscillator 



We will next develop the Wigner distribution for the quantum harmonic oscillator. The Hamiltonian 
is [2] 

H = — + -kx^, (4.61) 
where k is the spring constant, and the angular frequency is 

cv=J-. (4.62) 
\ m 

From the time-independent Schrodinger equation, eqn. 3.77, we have 

- ^d',4,{x) + ^kx^ipix) = Eipix). (4.63) 

This equation is readily solved using power series, and has the well-known family of solutions 
[4, 2, 13] 

which correspond to states of constant energy [2] 



ln = {n + ^a,. (4.65) 



For the purposes of this example, we will concentrate on the ground state {n = 0) and the first three 
excited states, shown in figure 4.1. In order to calculate the Wigner distribution of these states, we 
must use eqn. 4.2, so we need explicit forms for the matrix elements of the state operators in the 
position basis. Fortunately, since the harmonic oscillator is pure, we easily obtain these by 

Pn{x, y) = {X\ pn \y) = {x\lpn) {4'n\y) = lp'„{x)4' niy) , (4.66) 

where we used eqn. 3.58 to identify the wavefunction ^„{y) and its complex conjugate ^*^{x). Since 
ip{x) is real we have. 



Pn{x, y) = -\—) (^) ('"'"^ - '^.v) - dy) e^^e^y . (4.67) 




Figure 4.1: The first four energy states, ip„{x), of the harmonic oscillator. 
Particularly, for n = through n - 3,m units where m - co = fi - 1, we have 

1 _xW 

po(x,y) = —e 2 , 
1 

pi(x,y) = 2xy— e 2 , 

P2(x, y) = J {-2e-'' + Ae-'\^] (-2e-"' + 4e-y'y^) ^e^, 
8 ^ ' ^ yri 



(4.68) 



which we plot in figure 4.2. Now that we have the general form of the state operator matrix 
elements, it is just a matter of evaluating eqn. 4.2 to get the corresponding Wigner distributions. 
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n-0 





n - I 



ti = 2 



n 3 



Figure 4.2: The position representation of the state operator, p„(x, y), for the first four energy states of the 
harmonic oscillator. In the density plots, yellow indicates maximum values, while blue indicates minimum. 

Starting with n = 0, we have 



1 /a;;«\i/2 r 



d6 •e"'P'^e"i'^™(-^"2'^)' 



(4.69) 



which is just a three-dimensional Gaussian distribution. The calculations involved for the excited 
states are similar, but the algebra is significantly less trivial. They are easily performed using a 
computer algebra system, so we state the result. The Wigner distributions are 



Wo(x,p) 
Wi(x,p) 



2p^ + 2x^-1 
n 

.4 _L o^i . 



2p* + 2x* + 4p-'x^ - 4p^ -ix-' + l _„,_,z 

e " 

n 

4x'5 + 12^2^4 _ i^yA ^ I2p4x2 _ 35p2jf2 + ^^^1 + 4^6 _ igp4 ^ -^g^l _ 3 

3^ ' 



(4.70) 



-p -X 



which are plotted in figure 4.3. Note how Wq > for all values of x and p, but the higher energy 
states are sometimes negative. As we mentioned briefly before, the Wigner distribution is 
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Figure 4.3: The Wigner distribution, W„{x,p), of the first four energy states of the harmonic oscillator with 
their well-known shape [19]. In the density plots, yellow indicates maximum values, while blue indicates 
minimum. 

motivated by classical phase-space probability distributions, but is permitted to have negative 
values. These "negative probabilities" are a weird signature of a quantiun mechanical system. To 
make this analogy more concrete, we consider Wio(x, p), shown in figure 4.4. At the high energy 
of n = 10, the oscillations inside the Wigner distribution become increasingly rapid. In the 
classical limit, as n ^ oo, we expect the negative portions to overlap and cancel with the positive 
components, giving us a positive-definite, classical probability distribution. In order to force 
this for n = 10, we perform a careful function smoothing, known as a convolution, of Wip with a 
simple Gaussian. Mathematically, this is [26] 

WJo(X,P)s J dxdp- Wio(x,p)e-<^-*)'-(^-'')'. (4.71) 

As shown in figure 4.4, this averages the inner oscillations to zero, but retains a large, outer, 
positive ring. This is what we expect, since a classical simple harmonic oscillator has eliptical 
orbits in phase-space. 
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Figure 4.4: The position representation of the state operator, piO{x, y), at n = 10, its associated Wigner 
distribution, Wio(x,p), and the smoothed Wigner distribution generated by convolution with a Gaussian, 
WJ„(X,P). 



CHAPTER \^ 

The Master Equation for Quantum Brownian Motion 



IN this chapter, we develop the fundamental equation of quantum decoherence, the master 
equation for quantum brownian motion. The master equation dictates the time-evolution of a 
system and an environment with which the system interacts (these terms will be precisely defined 
later). To facilitate this, we use the formalism of the Wigner distribution developed in chapter 4, 
since it incorporates both position and momentum simultaneously, and consider how the system's 
Wigner distribution changes with time. Then, we invert the Wigner transformation to get the 
master equation, written in terms of the the system's state operator. After the equation is developed 
in this chapter, we examine its physical meaning and work through an example in chapter 6. 

5.1 The System and Environment 

The idea of collisions between a system and environment can be represented intuitively in a 
physical picture, as shown in figure 5.1. However, before we begin, we need to define precisely the 
notion of system and environment. Further, we need to specify what we mean by an interaction or 
collision between the system and environment. 



Definition 5.1 (System). A system, or system particle, denoted as S, is a single, 
one-dimensional point particle. It has momentum ps, mass nzg, and position Xg. 
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Collision 







Environment 









Figure 5.1: A graphic representation of the system and environment. Note that one of the environment 
particles is undergoing a collision with the system. For simplicity we consider the corresponding 
one-dimensional problem. 



Definition 5.2 (Environment). An environment of a system S is denoted £s/ and 
consists of an ideal one-dimensional gas of light particles. Each of these particles has 
momentum pE, mass me, and position xe- We will often abbreviate £s to £ if it is clear to 
what system S the environment belongs. 



Definition 5.3 (Collision). A collision between a particle of an environment £ and a 
system S is defined as an instantaneous transfer of momentum that conserves both 
kinetic energy and momentum. 



It is important to note that the system we are considering is very large (massive) when 
compared to the individual environmental particles. Precisely, we take [22] 



and we will typically neglect terms of second or higher order in this factor. 

Now that we have defined the key objects treated by the master equation, we begin to 




(5.1) 
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investigate its structure. As stated above, we first want to consider how the Wigner Distribution of 
the system, Wg, changes with time. Quantum mechanically, we separate this change into two 
pieces. First, Wg undergoes standard unitary time evolution, with the system treated as a free 
particle. Second, S collides with environment particles, and the collisions alter the system's energy 
and momentum. In section 4.3, we considered the change in the Wigner distribution of a particle 
due to its free evolution, which we will make use of later Now, we begin to consider the influence 
of an environment on a system. 



5.2 Collisions Between Systems and Environment Particles 

Before we begin to examine how a system behaves in the presence of an environment, we first 
consider the collision between a system particle and one particle from an environment. For 
each collision, we derive equations for momentum and position change. First, we address 
momentum change. 

Let ps and pE denote the initial momenta of a system and an environment particle. By 
definition 5.3, the interaction between the two particles is totally elastic. That is, both kinetic 
energy and momentum are conserved. We write kinetic energy conservation as [9] 

zL + A^zL + iL (5 2) 

2ms ^niE 2ms "^^ie' 

which is equivalent to 

ms {pE - Pe) (pe + Pe) = -niE (ps - ps) (ps + ps) , (5.3) 
and momentum conservation as [9] 

Ps +Pe=Ps+ pE, (5.4) 

which is also written as 

(Pe - Pe) = - (ps - Ps) • (5.5) 
We then assume that, since a collision occurred, the momenta of both the system and environment 
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particle have changed, i.e. pE - pE and ps -ps 0. So, we divide eqn. 5.3 by eqn. 5.5 to get 

ms{pE-pE){pE+pE) -mE(ps-ps){ps+ps) 

7 ^1 = 7 ' (^-6 

(Pe - Pe) - {ps - Ps) 

which implies 

ms (Pe + Pe) = m (ps + ps) ■ (5.7) 
Then, we solve eqns. 5.5 and 5.7 simultaneously for both ps and pE- We have [9] 

ms(pE+pE) = mEips+Ps+PE-pE) ^ -{nis -mE)pE+2mEps = {mE + ms)pE 

_ nis-niE , 2ote ,j. ^. 

=^ Pe = ; Pe + ■ Ps (5.8 

nis+niE nis +mE 

and 

ms (pe +Pe+Ps- ps) = mE (ps + ps) {ms - mE)ps + 2mspE = {ms + mE)ps 

. ms-niE , 2ms ,r- a\ 

^ Ps = ■ Ps + ■ Pe, (5.9 

nis + ms + wi£ 

which are the changes in the momenta of the environment particle and the system. 

Now that we have investigated the momentum change that results from a collision, we will 
develop the corresponding position change. To do this, we use the plane wave treatment for the 
total system we developed in section 3.4 and note how changes in momentum imply changes in 
position. 

The wavefimction of the composite system containing both the system and environment 
particle, a product state, is given by^ 

<p = (ps(pE- (5.10) 

Using equation 3.89, we can form the composite plane wave, (pi, from the individual incident plane 
wave of each particle.^ This is 



^Since the product state vector is \<p^ = \(ps'^ ® I'Pe), the wavefunction form of the composite 
state takes ordinary multiplication. 



^Remember that plane waves are states of definite momentum. We are using them in this case 
because we are conserving the momentum in the collision. 
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After collision, using the momentum representation, the plane wave, (pf, becomes 

cPf = eV''''e'P''"^. (5.12) 

By eqns. 5.8 and 5.9, this can be written as 



f-r. r > \ (./ms-niE 2ms \ \ (.( nis-niE Im^ \ \ 
Exponent of c^/ = \i\ ; ps + ; pe ks + H i Pe + \ Ps P^e 

Jms-niE 2ms ms - mE 2ote \ 

= I Ps^s "I Pe^s Pe^e h Ps^E 

Vjws + ote ms+mE ms + mE ots + wie / 

/. Ims-mE 2wiE \\ /. / 2ms ms - OTe \\ 

= \ips\ Xs H Xe + UPe Xs Xe ■ 

\ \ms+mE ms+mE // V Vots+ote ms+mE 11 



We define 



where [22] 



and 



Xs = Xs H Xe (5.14) 

ms + niE ms + niE 



2ms ms -mE ^, 
Xe = Xs Xe. (5.15) 

ms + twe ms + mE 

This way, we now have position and momentum representations of the collision. As is common 
in physics, we need to require that these collision interactions are local. ^ Thus, throughout the 
collision, we take 

|xs -xeI «c |xsl (5.16) 

and 

|xs -xeI « |xe|, (5.17) 
since the potential energy, V{xs - xe) as \xs - xe\ — > oo. Recalling that from eqn. 5.1 

— « 1, (5.18) 

ms 



^It is important to emphasize that locality is not an approximation, but is necessary to include 
in our treatment. Ideally, we would work this into our equations formally. However, for simplicity, 
we can achieve local interactions by requiring this condition. 
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it is reasonable to ignore contributions to distances of order 



— {xs-xe)<^xs,xe. (5.19) 
ms 



Enforcing the locality of collision, we find 



Ms -niE IniE 

xs = , xs + , Xe 

ms + mE ms + me 



me 2mE \ Ime 

]xs + Xe 

ms + mE ms + mEi ms + me 



I ms+niE _ 2mE \ 
\ms + mE ms + niEi 

V ms + mEi 



Ime \ Im-E 

— \xs + , Xe 

ms + ME/ ms + niE 

IviE IviE 

Xs Xs + Xe 

nis +mE ms + me 

Xs + , (Xe - Xs) 

ms + mE 

xs + 2^ (XE -Xs)-2 (^f {XE - xs) + ... 

ms \msi 

Xs (5.20) 



and 



2ms ms - mE 

Xe = ; Xs ; Xe 

ms + jwe ms+ mE 

(2ms+2mE 2mE \ I 2mE ms+mE\ 

' - ■ \xe 



12ms + 2mE 2mE \ I 
I ms + mE ms + mEi 1? 



ms + mE ms + mE I Vms + me ms + mE/ 
V ms+mE I \ms + mE I 

= 2.s-XE + ^iXE-Xs) 

ms + mE 

= 2xs -XE+2— {xe -xs)-2 (^f {xe - xs) + ... 

ms \msl 

~ 2XS-XE, (5.21) 



which amounts to a phase shift in our plane wave state. We have now worked out all the 
position and momentum components we will need to treat the full case of a system coupled to an 
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environment. In summary, we have 



ms - niE 2ms 

Ps = Ps "I Pe, 

Ms +niE ms + niE 

ms -mE 2wie 

Pe = Pe -i Ps/ 

ms +mE ms + mE 

xs ~ Xs, 

xe ~ 2xs-xe. (5.22) 



5.3 Effect of Collision on a Wigner Distribution 

In this section, we consider the change in the Wigner distribution of the system, Wg, from one 
collision with an environment particle. Since we have a composite state of environment and system 
particle, we use equation 4.16 to write the Wigner distribution for the system and environment as 

Ws^E = WsWe. (5.23) 

It follows that the change in the total Wigner distribution for the system and environment, AWs+e, 
due to one collision is [22] 

AWs+£ = Ws+E - Ws+E 

= WsWe - WsWe 

= Ws (xs,ps) We (xe,Pe) - Ws{xs,PsWe{xe,Pe)- (5.24) 

Now that we have the change in the total (system and environment) Wigner distribution, we use 
eqn. 4.17 developed in section 4.2 to deduce AW, the change in the system's Wigner distribution, 
by summing (integrating) over all environmental configurations. We have [4] 

AW = :?I(AWs+e) 

= J*....^W.fe,/>s)W.fe,p.)-/*,<i«H.Wsfe.p.)lV,ta,pri. (5.25) 
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To evaluate these integrals, we need to perform some algebraic manipulations on the first term in 
eqn. 5.25. From the eqn. 5.22, we know that the first term of eqn. 5.25 is (approximately) given by 

dpE^^E • Ws xs, ps + Pe We 2xs - ^e, Pe + Ps • (5.26) 

J \ nis + Me nis + Me / \ ms+niE nis + niE / 

We make the substitution 



s 2xs - xe 

ms-niE 
nis + Me 



Pe + 



2mE 
nis + niE 



(5.27) 



from which it follows that 



dxE- - -du 

nis+niE 

dpE = 

ms - Me 

Further, since 



dv. (5.28) 



Pe = { ; Ps-iy], 5.29 

\ms - VlE I \ni$ + THE I 

we have 

m$-mE 2ms ms-rriE 2ms (ms+mE\( 2mE \ 

Ps + Pe = Ps + Ps-^] 

ms+mE ms+mE ms+mE ms + mE \ms - mE/ \ms + mE I 

ms - mE AniEms 2ms 

ms+niE^^ {ms-mE){mE + ms)^^ ms-ms^ 

mE + ms 2ms 
= Ps V 

ms - mE ms - mE 

= Ps + ^^ —■ (5.30) 

ms - ms 

Thus, substituting eqns. 5.27, 5.28, and 5.30 into eqn. 5.26 gives 

r dvduWs lxs,ps + '^^^P^-^^-A We («,.) . (5.31) 
ms-ME J \ nis - Me ) 

Next, we make the substitution u = xe and v = pE, so eqn. 5.26 becomes 

r dpE^XE . Ws lxs,ps + We (xe,Pe) ■ (5.32) 

ms-mE J \ ms-mE I 
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Now, eqn. 5.25 is 

AW = ^ r rfperfxE ■ Ws (xs,ps + ^ ^""^^^ ~ '"'^^^ ] We (xe^Pe) (5.33) 
nis-niE J \ ms-niE I 

- ldpEdXE-WsiXs,ps)m^E,p.) 

= f dpEdxE ■ ( ^^^'"" Ws (xs,ps + ^iai^PlZ^!!^] _ Ws(xs,ps)) We (xe,pe) • 
J \ms-mE \ nis-niE I j 



I , 2{mEPs-msPE)\ ,_ „ 

Ws xs, ps + (5.34) 



Next, we expand 

/ 2(f 

ms-niE 

using a Taylor series expansion in momentum about p = ps- This is 

/ 2{mEPs-msPE)\ . 2 (wzePs - "JsPe) <?Ws 

Ws xs,ps + = Ws(xs,ps) + 5 — (xs,ps) 

\ ms-niE I nts - Me dps 

l( 2{mEPs-mspE) \^d^Ws^ , 

+ o -r^(^s,Ps +••• (5.35 

2 \ ms-niE I dpi 

In order to justify dropping the high-order terms of the expansion, we need to show that 

2 (otePs - mspE) ^^11 ,^ „ 

« \ps\ , (5.36) 

ms -me ' ' 

which is not readily apparent. If we expand the term in OTe/ots/ we have 

2{mEPs-msPE) ^ ^'«E x/'^Ef , /ra-TA 

= -2pE + 2 (ps - Pe) — +2 (ps - Pe) — + - 5.37 

nis - niE ms Vots / 

Recalling eqn. 5.1, it is obvious that while the terms of first order and higher in ntE/nts are small 
compared to ps, the first term, -2pE, is not necessarily small with respect ps- Fortunately, since we 
are expanding in an integrand and the average value of pE is zero, we can neglect this term and so 
we are justified in dropping high order terms in our Taylor expansion.^ Simplifying coefficients 
and dropping terms of third order or higher, eqn. 5.35 is approximately 



2 (otePs - otsPe) dWs , , { 2'«sPe " 4oteOTsPePs + 2OT2p2 ^ ^2yy, 



Ws(xs,ps) + 5 — (xs,ps) + 



ms - niE dps ' [ {niE- ms) 



s2 



dpi 



'(xs,ps). (5.38) 



^The fact that the average value of pE is zero is dealt with explicitly in eqn. 5.59. 
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Hence, we write AW as [22] 



AW ~ J dpEdxE-{A'Ws{xs,psWE (xe,Pe) + Bdp,'Ws{xs,psWE {xe,Ve) + C*?^, Ws(xs,Ps)We {xe,Pe)) , 

(5.39) 

for some A, B, and C. We now work out the values of these coefficients, starting with A, which is 



^ ^ ms + niE _^ 
nis - niE 
2mE 

Ms-Me 

2mE ^ 1/ms 
ms-niE lints 
IviE 1 



ms l-iriE/ms 

2mE _|_ ^ _|_ _^ 
ms \ ms [msJ 

2 



ms \ms 

~ 2—, (5.40) 

ms 

where we used the approximation in eqn. 5.1 to neglect the terms of order two or higher in mE/ms- 
We now turn to B, given by 

^^/ ms+mE \ 2 (otePs - w^sPe) 
[ms-mE/ ms-mE ' 

Anticipating a series expansion, we change variables to r = mE/ms so that 7«e = rms- B is then 

(ms + rms\2{rmsps - msPE) (1 + r\2{rps - Pe) 



B = 



\ Z{rmsps-mspE) _ ( 1 + r \ l{rps - Pe) 
ms-rmsl ms-rms \l-r) 1-r 



We also calculate the first and second derivatives of B with respect to r. They are 

dB _ 2pe(3 - r) - 2{ps + 3psr) 
dr~ (r - 1)3 

and 



(5.43) 



d^B _ 4pE(5 + r)-12ps(l+r) 

dr2 " (r-l)4 ■ ^^-^^^ 
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Taking the Taylor series expansion of B in r about r - 0, we find 



I dB 



d^B 

1-0 dr-^ 



= -2pE + {2ps - 6pe) ■ r + (12ps - 20pe) ' y + - 



= -2pE + (2ps 
~ -2pE + {2ps 



,,).^+(6ps-10p.)-(^) +, 
OTs \ms/ 



Me 
ms 



= 2p; 



niE 
ms 



2 + 6 — )pe, 



JHe' 



(5.45) 



where we used eqn. 5.1 to neglect the terms of order two or higher in wie/ots. Finally, we consider 
the coefficient C, given by 



C = 



OTs + ote \ 2?n^p^ - imEMspEPs + 27n^p^ 



/ ms+mE \ 



(5.46) 



In the same way we worked out coefficient B, we make the substitution Me - rms, which gives us 



C 



1 +r \ 2pl-ArpEps +2r 



1 + r\ 



,2„2 



(r-1)^ 



and 



4 (p2 (2 + r) + p2r(l + 2r) - pEps (l + 4r + r^)) 
d7 " (r - 1)4 ' 

^2q 4 (2pEPs (4 + 7r + r^) - 3p|(3 + r) - ^1 + 7r + 4r2)) 
d^ " 



(r-l)5 

When we take the Taylor series expansion of C in r about r - 0, we have 



(5.47) 



(5.48) 



(5.49) 



C = C 



r=0 



dCi d^Ci 
+ — ■r+ — - • TT + - 

ar l/-=o ar^ li-o 2 



= 2p| + (Spl - 4pEPs) ■ + (36p| - 32pEPs + ^p's) ' y + - 

= 2p| + (Spl - 4pEPs) ■ ^ + (l8p| - 16PEPS + 2p2) ■ (^) 

~ 2p| + (8p2,-4pEPs)-^ 

= (2 + 8|g)pE-4PEPs 



OTe 



(5.50) 



5. The Master Equation for Quantum Brownian Motion 



83 



Thus, using eqn. 5.39, we can write AW as 

AW ~ X + y + Z, (5.51) 

where 

X = (2^) J dpEdxE ■ We{xe,Pe)Ws{xs,Ps), (5.52) 

Y = (2ps^) J dpEdxE-WE{xE,pE)dps'Ws{xs,ps)-{2 + 6^^ J dpEdXE-pEWE{XE,PE)dpsWs{Xs,Ps), 

(5.53) 

and 

Z = (2 + 8^) J dpEdXE-plWE{XE,pE)d%Ws{Xs,ps)-Aps'^^ J dpEdXE-pEWE{XE,PE)d%Ws{Xs,ps)- 

(5.54) 

Now, we recall from our preliminary discussion on the marginal distributions of the Wigner 
distribution in section 4.1.3 that 

J dpEdxE- 0{pe)We{xe,Pe)Ws{xs,Ps) = Ws{xs,ps) JdpE-0{pE) J^dxe ■ We(xe,Pe) 

= Ws{Xs,ps) J dpE-0{pE)p(pE,PE) 

= Ws(xs,ps)Tr(6p) 

= Ws{xs,ps){d), (5.55) 
where O is an observable. Hence, our previous calculations yield 

X = (2^)<1) Ws(xs,ps) = 2^Ws(xs,ps), (5.56) 
\ ms I nis 

Y = (2ps—)dp^Ws{xs,ps) - (2 + 6^)(pe)^psWs(xs,Ps), (5.57) 

and 

Z = (2 + 8 ^ ) {p2 ) Ws(xs, ps) - Aps ^ <Pe) d% Wsixs, ps). (5.58) 

However, originally we considered the environment as an ideal (one-dimensional) gas of 
environment particles, so it is reasonable to assume that any measurement of an environment 
particle momentum is equally likely to be in the opposite direction, i.e. (pe) = 0. Eqn. 5.51 then 
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becomes 



AW 



2^Ws(xs,ps) + (2ps^)^PsWs(xs,ps) + (2 + 8^)(p|)^2^Ws(xs,ps). (5.59) 



We notice that 



2|gWs(xs,ps) + (2ps^)^psWs(xs,ps) = 2'^^(Ws{xs,ps) + PsdpsWs{xs,ps)) 



(5.60) 



so we write the change in the Wigner distribution of the system due to one environmental collision 
as 



5.4 The Master Equation for Quantum Brownian Motion 

In our simple model, the system is only under the influence of environmental particles, and is free 
otherwise. Thus, the total change in the system's Wigner distribution with time is given by its free 
particle term added to some contribution due to the environment. Since the environment acts on 
the system through collisions, if we define T to be the statistical number of collisions per unit time 
between the system and environmental particles, we combine eqns. 4.27 and 5.61 to get [22] 



- = -^<9.sW(xs,ps,f) + r(2^<?p,(psWs(xs,ps)) + (2 + 8^)(p|)5^^..Ws(xs,ps)), (5.62) 



an expression for the total change in the system's Wigner distribution with time. We use table 4.1 
to convert our equation for the Wigner distribution to an equation for the state operator of the 
system. This is 




(5.61) 




(5.63) 
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Noting that this is true for all ps{x, y), we have 



dtps{x, y) = ^ (^l - °'y) Ps(x, y)-r^(x-y) [d^ - d,^ ps{x, y)-T (2 + 8^) (pi) (x - yf ps{x, y). 

(5.64) 

We take the standard definition for the dissipation rate y to be [22] 



y = — r, (5.65) 

ms 



so 



dtps{x, y) = ^ (^l - ^y) ps{^, y)-y{x-y) (dx - dy) ps{x, y)-y^ ^5) ^'^^^ ~ ^''^ 

(5.66) 

To express this result in standard form, we use the definition for temperature in one dimension 
from statistical mechanics, which is [27] 

where T is temperature and k is the Boltzmann constant. Using this definition, we examine the last 
term more closely and find 

= y{2ms+8mE)kT(x-yf ps{x,y) 

~ ylmskT {x - yf ps(x, y), (5.68) 



where we have used the fact that me <^ ms- Thus, our final result is [22] 



dtpsix, y) = ^ i^l - ^y) Ps{x, y) - y{x - y) [dx - dy) ps{x, y) - InisykT {x - yf ps{x, y), (5.69) 

which is the accepted master equation for quantum Brownian motion [3, 12, 22]. Using dimensional 
analysis, we can reinsert % to bring the master equation into SI units. This is 
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dtpsix, y) = [dl - 5y) ps{x, y) - y{x - y) (^.v - d,^ ps{x, y) - -^ykT (x - yf ps{x, y). (5.70) 
The assumptions used to derive this equation are listed in table 5.1. 



Table 5.1: Assumptions used for the derivation of eqn. 5.69 



Assumption 


Equation 


Label 


Small mass ratio 


wze/ots «; 1 


5.1 


Locality 


\xs-xe\ « \xs\ 


5.16 


Statistical environment 


(pe) = 


5.59 


Dissipation 


y = niE/ms T 


5.65 


Temperature 


l/2-kT = {pl)/{2mE) 


5.67 



CHAPTER 

Consequences of the Master Equation 



WE now explore the physical ramifications of the master equation for quantum Brownian 
motion, developed in the previous chapter. First, we investigate its physical meaning term 
by term. Next, we consider the simple example of a quantum harmonic oscillator undergoing 
decoherence. Finally, we offer some closing remarks on decoherence theory in general and 
suggestions for further reading. 

6.1 Physical Significance of the first two terms 

In the realm of master equations, eqn. 5.69 for quantum Brownian motion actually is simple [12]. 
Even so, the purpose of each term is not immediately obvious. In this section, we examine the 
physical meaning of the first and second terms. The first term is the free system evolution, as it is 
the transform of eqn. 4.27. It does not hurt to verify this explicitly, without employing the Wigner 
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distribution. If we switch to SI units via eqn. 5.70, the first term is 

tfi I tfi I 

By eqn. 3.20, the free system (for which V = 0) has a Hamiltonian of 

= (6.2) 

so our equation becomes 

- {x\ (Hfps - PsHf) \y) = {x\ [ps, Hf\ \y) , (6.3) 

which is 

{x\dtpc,\y) = dtps{x,\j) (6.4) 

by eqn. 3.55. Thus, we confirm that the first term in the master equation is the free evolution of the 
state operator. 

The second term is not so obvious, and turns out to be responsible for damping our system's 
motion. To explain this, we use the master equation to calculate the rate of change of the 
expectation value of momentum due to the second term. In the position basis, the second term 
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reduces to [3] 



- Tr(Mp), 



-yTr (P^y(x - y) (d^ - dy) p{x, y)) 

-yTr ^y(9.v [y{x - y) [d^ - d,^ p{x, y)] j 

-yTr Q {d^ - dy) p{x, y) j - yTr Q(x - y) [d^ - d^dy) p{x, y)j 

-y ^ dx • ^ - (9y j p(x, x) + y ^rfx ■ -!-(x - x) - 5.^5^^ p(x,x) 

r 1 

-y I dx ■ - - 0) p(x, x) + 

-7Tr(y(9,p(x,y)) 
-yTr(Pp) 

-y{P), (6.5) 



which is 



dt{P\ = -y{P). (6.6) 

Hence, the contribution to the rate of change of momentum of the second term is the dissipation (a 
scalar) times the momentum, pointed in the opposite direction as the momentum. This is precisely 
a damping effect, which is what we wanted to show [14]. 

6.2 The Decoherence Term 

The last term of the master equation turns out to cause decoherence of the system, so it is central to 
our discussion. To interpret it, we will make some reasonable approximations. To get a better idea 
of the relative size of the terms, we use the SI version of the master equation, eqn. 5.70. Notice that 
the last term contains a numerical factor of ^ 10^^, while the other terms are either first or 



6. Consequences of the Master Equation 90 

zeroth order in 1 /ft. Thus, we surmise that for sufficiently large |x - y\, the last term will dominate 
equation.^ Hence, our drastically simplified master equation is [3] 



InisykT 2 

dtpsix, y) — (x - y) ps(x, y), (6.7) 



which has the standard solution 

ps{x,y,t) = ps{x,y,0)e-^^'-y> (6.8) 



Since the argument of the exponential must be dimensionless, — has units of time. 

^ 2msykT{x-y) 

Customarily, we identify [3] 



fi2 

td = ^ (6.9) 

ImsykT {x - y) 



as the (characteristic) decoherence time of the system, which is its e-folding time.^ Notice also that 
the decoherence time varies with location in state-space, as it depends on both x and y. Thus, we 
are not surprised to find that some regions decay faster than others. Further, since %^ ~ 10~^^ in SI 
units, the decoherence time for any reasonably large system is incredibly small.'^ Next, we consider 
an example to show how decoherence operates on a simple situation. 
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Figure 6.1: The decoherence of the ground state of the quantum harmonic oscillator under the simplified 
master equation 6.11. In the density plots, yellow indicates maximum values, while blue indicates minimum. 




Figure 6.2: The decoherence of the third excited state of the simple harmonic oscillator under the simplified 
master equation 6.11. In the density plots, yellow indicates maximum values, while blue indicates minimum. 
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6.3 Example: The Harmonic Oscillator in a Thermal Bath 



So far, we have supposed that ps is a free particle. However, note that our simplified master 
equation, eqn. 6.7, does not explicitly depend on the system's Hamiltonian (this was contained in 
the first term), so we are free to replace our initial state operator with some other state operator of a 
different system. We choose, due to its utility and familiarity, the harmonic oscillator. From our 
work in section 4.5, we know that state operator for the harmonic oscillator, eqn. 4.67, is 

pn{x,y,t = 0) = - (—J (—J {mcox - d,)" (mcoy - dy) e"^ e"-!' . (6.10) 

If we place this state operator in a thermal bath, we expect the system to evolve approximately 
according to eqn. 6.8, so the time dependent state operator of the harmonic oscillator is 



p„(x, y, = ^ (^)'^' (^)" {rncox - d,)" (mcoy - dy)" e-''f^\-"-ffe-^"'^y>^-'i^-yf*. (6.11) 

In figures 6.1 and 6.2, we plot the state operators for n = Q and n = 3. As is evident from the form of 
eqn. 6.11, the off-diagonal matrix elements (when x + y) quickly vanish with time. Physically, the 
off-diagonal elements of the state operator represent the quantum interference terms, terms that can 
interact only with other quantum systems. These interference terms are what give the entangled 
states we explored in sections 2.4 and 2.5 their interesting qualities. By zeroing the off-diagonal 
elements, we take a quantum mechanical system and force it into a classical distribution. 
As it turns out, this interpretation becomes obvious ast ^ co. By eqn. 6.11, this is 



^In the matrix representation of a state operator, this corresponds to the off-diagonal elements. 
Recall that the totally mixed state in eqn. 2.57 had a diagonal state operator. This confirms that 
decoherence works on the off-diagonal elements of the state operator. 

^When t = td, p(x, y, td) = \p(x, y, t). 

^For example, if we suppose our environment is an ideal, one dimensional gas at room 
temperature with a mass of 10"^* kg per particle and a collision rate with the system of F ^ 10^" 
collisions per second (atmospheric conditions), we find the decoherence time of the system for 

length scales of nanometers to be of order = 2mETkT{x-y)^ ~ 2.io-2'-.io'°°io"23.30o-io-^ ~ ^- 
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lim p{x, y, t) = 

t — >cx> 



if X y, 

ip*{x)ijj{x) = \^{x)^ if X = y. 



(6.12) 



as shown in figure 6.3. This quantity is a statistical probability distribution, and as we saw with the 
roulette wheel at the beginning of this thesis, decoherence has effectively blocked us from 
accessing any of the quantum mechanical information present in our initial system. 



6.4 Concluding Remarks 



We have now developed and applied the master equation for quantum Brownian motion, and 
used it to clarify how a macroscopic, classical object might emerge from quantum mechanics. We 
started by setting the stage with the mathematics and formalism we would need to develop 
quantum mechanics. Then, we used the tools we made to derive the Schrodinger equation and the 
equation of motion for the state operator. 

We then shifted and considered quantum mechanics in phase-space, where the central object is 
the Wigner distribution. Next, we explored some of its key properties and described and example 
of its application using the harmonic oscillator. After that, we used it to derive the simple 
master equation for one-dimensional quantum Brownian motion. We explained each of the 
terms physically, and finally considered an example of decoherence, where the master equation 
transformed a quantum harmonic oscillator into a classical probability distribution. 




Figure 6.3: The final state reached by the first four energy states of the simple harmonic oscillator under the 
simplified master equation 6.11. The two-dimensional plots coincide with the diagonal of the density plots. 
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The debate still rages in the physics commimity; does decoherence theory solve the philosophical 
problems brought about by paradoxes like Schrodinger's cat, or does it merely postpone the 
problem, pushing the fvindamental issue into an environmental black box [28, 29, 30]? Regardless, 
it provides a practical framework for performing objective measurements without an observer, 
which is of key importance to the emerging fields of quantum computation and quantum 
information. Current efforts are underway to probe decoherence directly, both experimentally 
and theoretically. Through the use of mesoscopic systems, scientists have been able to manufacture 
tiny oscillators that are getting very close to the quantum regime [31, 32]. Theoretical predictions 
of what should be observed at the quantum-classical barrier have also been made, with the 
promise of experimental feasibility within a few years [33]. 

Just last year, scientists performed experiments involving ultra-cold chlorophyll, confirming 
that even photosynthesis is a quantum-emergence phenomenon, and thus governed by decoherence 
theory [34]. The group went so far as to suggest that chloroplasts were actually performing 
quantum computation algorithms on themselves to speed-up reaction times. This idea of selective 
self -measurement is intriguing, but largely undeveloped theoretically. It, along with the many 
other application areas of quantum decoherence theory, are sure to occupy physicists for years to 
come. 
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